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Preface 


EYYUC 5’ fjv teXeoc’ o Se to xpixov fjxs /[apa^E- 
aijv xOi 8’ "dcivaxov xal xfj[po( pEXaivav 


Hesiod (?), Ehoiai, fr. 76.21-2 Merkelbach and West 

The ternary Goldbach conjecture (or three-prime conjecture) states that every odd 
number n greater than 5 can be written as the sum of three primes. The purpose of this 
book is to give the first full proof of this conjecture. 

The proof builds on the great advances made in the early 20th century by Hardy and 
Littlewood (1922) and Vinogradov (1937). Progress since then has been more gradual. 
In some ways, it was necessary to clear the board and start work using only the main 
existing ideas towards the problem, together with techniques developed elsewhere. 

Part of the aim has been to keep the exposition as accessible as possible, with 
an emphasis on qualitative improvements and new technical ideas that should be of 
use elsewhere. The main strategy was to give an analytic approach that is efficient, 
relatively clean, and, as it must be for this problem, explicit; the focus does not lie in 
optimizing explicit constants, or in performing calculations, necessary as these tasks 
are. 

Organization. In the introduction, after a summary of the history of the problem, 
we will go over a detailed outline of the proof. The rest of the book is divided in three 
parts, structured so that they can be read independently; the first two parts do not refer 
to each other, and the third part uses only the main results (clearly marked) of the first 
two parts. 

As is the case in most proofs involving the circle method, the problem is reduced to 
showing that a certain integral over the “circle” M/Z is non-zero. The circle is divided 
into major arcs and minor arcs. In Part|^- in some ways the technical heart of the proof 
- we will see how to give upper bounds on the integrand when a is in the minor arcs. 
Part [n| will provide rather precise estimates for the integrand when the variable a is in 
the major arcs. Lastly, Part m shows how to use these inputs as well as possible to 
estimate the integral. 

Each part and each chapter starts with a general discussion of the strategy and 
the main ideas involved. Some of the more technical bounds and computations are 
relegated to the appendices. 
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Chapter 1 

Introduction 


The question we will discuss, or one similar to it, seems to have been first posed by 
Descartes, in a manuscript published only centuries after his death BDesOSI p. 298]. 
Descartes states: “Sed & omnis numerus par fit ex uno vel duobus vel tribus primis” 
(“But also every even number is made out of one, two or three prime numbers.’^) This 
statement comes in the middle of a discussion of sums of polygonal numbers, such as 
the squares. 

Statements on sums of primes and sums of values of polynomials (polygonal num¬ 
bers, powers n^, etc.) have since shown themselves to be much more than mere cu¬ 
riosities - and not just because they are often very difficult to prove. Whereas the study 
of sums of powers can rely on their algebraic structure, the study of sums of primes 
leads to the realization that, from several perspectives, the set of primes behaves much 
like the set of integers, or like a random set of integers. (It also leads to the realization 
that this is very hard to prove.) 

If, instead of the primes, we had a random set of odd integers S whose density - 
an intuitive concept that can be made precise - equaled that of the primes, then we 
would expect to be able to write every odd number as a sum of three elements of S, 
and every even number as the sum of two elements of S. We would have to check by 
hand whether this is true for small odd and even numbers, but it is relatively easy to 
show that, after a long enough check, it would be very unlikely that there would be any 
exceptions left among the infinitely many cases left to check. 

The question, then, is in what sense we need the primes to be like a random set of 
integers; in other words, we need to know what we can prove about the regularities of 
the distribution of the primes. This is one of the main questions of analytic number 
theory; progress on it has been very slow and difficult. 

Fourier analysis expresses information on the distribution of a sequence in terms 
of frequencies. In the case of the primes, what may be called the main frequencies - 
those in the major arcs - correspond to the same kind of large-scale distribution that 
is encoded by L-functions, the family of functions to which the Riemann zeta function 

'Thanks are due to J. Brandes and R. Vaughan for a discussion on a possible ambiguity in the Latin 
wording. Descartes’ statement is mentioned (with a translation much like the one given here) in Dickson’s 
History IDic66l Ch. XVIII]. 
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belongs. On some of the crucial questions on L-functions, the limits of our knowledge 
have barely budged in the last century. There is something relatively new now, namely, 
rigorous numerical data of non-negligible scope; still, such data is, by definition, finite, 
and, as a consequence, its range of applicability is very narrow. Thus, the real question 
in the major-arc regime is how to use well the limited information we do have on the 
large-scale distribution of the primes. As we will see, this requires delicate work on 
explicit asymptotic analysis and smoothing functions. 

Outside the main frequencies - that is, in what are called the minor arcs - estimates 
based on T-functions no longer apply, and what is remarkable is that one can say 
anything meaningful on the distribution of the primes. Vinogradov was the first to give 
unconditional, non-trivial bounds, showing that there are no great irregularities in the 
minor arcs; this is what makes them “minor”. Here the task is to give sharper bounds 
than Vinogradov. It is in this regime that we can genuinely say that we learn a little 
more about the distribution of the primes, based on what is essentially an elementary 
and highly optimized analytic-combinatorial analysis of exponential sums, i.e., Fourier 
coefficients given by series (supported on the primes, in our case). 

The circle method reduces an additive problem - that is, a problems on sums, such 
as sums of primes, powers, etc. - to the estimation of an integral on the space of 
frequencies (the “circle” K/Z). In the case of the primes, as we have just discussed, we 
have precise estimates on the integrand on part of the circle (the major arcs), and upper 
bounds on the rest of the circle (the minor arcs). Putting them together efficiently to 
give an estimate on the integral is a delicate matter; we leave it for the last part, as it 
is really what is particular to our problem, as opposed to being of immediate general 
relevance to the study of the primes. As we shall see, estimating the integral well does 
involve using - and improving - general estimates on the variance of irregularities in 
the distribution of the primes, as given by the large sieve. 

In fact, one of the main general lessons of the proof is that there is a very close 
relationship between the circle method and the large sieve; we will use the large sieve 
not just as a tool - which we shall, incidentally, sharpen in certain contexts - but as a 
source for ideas on how to apply the circle method more effectively. 

This has been an attempt at a first look from above. Let us now undertake a more 
leisurely and detailed overview of the problem and its solution. 


1.1 History and new developments 

The history of the conjecture starts properly with Euler and his close friend, Christian 
Goldbach, both of whom lived and worked in Russia at the time of their correspon¬ 
dence - about a century after Descartes’ isolated statement. Goldbach, a man of many 
interests, is usually classed as a serious amateur; he seems to have awakened Euler’s 
passion for number theory, which would lead to the beginning of the modern era of 
the subject 0Wei84l Ch. 3, §IV]. In a letter dated June 7, 1742, Goldbach made a 
conjectural statement on prime numbers, and Euler rapidly reduced it to the following 
conjecture, which, he said, Goldbach had already posed to him; every positive integer 
can be written as the sum of at most three prime numbers. 
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We would now say “every integer greater than 1”, since we no long consider 1 to 
be a prime number. Moreover, the conjecture is nowadays split into two; 

• the weak, or ternary, Goldbach conjecture states that every odd integer greater 
than 5 can be written as the sum of three primes; 

• the strong, or binary, Goldbach conjecture states that every even integer greater 
than 2 can be written as the sum of two primes. 

As their names indicate, the strong conjecture implies the weak one (easily: subtract 3 
from your odd number n, then express n — 3 as the sum of two primes). 

The strong conjecture remains out of reach. A short while ago - the first complete 
version appeared on May 13, 2013 - the author proved the weak Goldbach conjecture. 

Theorem 1.1.1. Every odd integer greater than 5 can be written as the sum of three 
primes. 

In 1937, I. M. Vinogradov proved IIVin37l that the conjecture is true for all odd 
numbers n larger than some constant C. (Hardy and Littlewood had proved the same 
statement under the assumption of the Generalized Riemann Hypothesis, which we 
shall have the chance to discuss later.) 

It is clear that a computation can verify the conjecture only for n < c, ca constant: 
computations have to be finite. What can make a result coming from analytic number 
theory be valid only for n > Cl 

An analytic proof, generally speaking, gives us more than just existence. In this 
kind of problem, it gives us more than the possibility of doing something (here, writing 
an integer n as the sum of three primes). It gives us a rigorous estimate for the number 
of ways in which this something is possible; that is, it shows us that this number of 
ways equals 

main term + error term, (1 ■ 1) 

where the main term is a precise quantity f{n), and the error term is something whose 
absolute value is at most another precise quantity g{n). If /(n) > g{n), then 113 is 
non-zero, i.e., we will have shown the existence of a way to write our number as the 
sum of three primes. 

(Since what we truly care about is existence, we are free to weigh different ways 
of writing n as the sum of three primes however we wish - that is, we can decide that 
some primes “count” twice or thrice as much as others, and that some do not count at 
all.) 

Typically, after much work, we succeed in obtaining ([U} with /(n) and g{n) such 
that f{n) > g{n) asymptotically, that is, for n large enough. To give a highly simplified 
example: if, say, f{n) = and g{n) = l OOn^^^, then /(n) > g{n) forn > C, where 
C = lO'^, and so the number of ways is positive for n > C. 

We want a moderate value of C, that is, a C small enough that all cases n < C can 
be checked computationally. To ensure this, we must make the error term bound g{n) 
as small as possible. This is our main task. A secondary (and sometimes neglected) 
possibility is to rig the weights so as to make the main term /(n) larger in comparison 
to g{n)-, this can generally be done only up to a certain point, but is nonetheless very 
helpful. 
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As we said, the first unconditional proof that odd numbers n > C can be written 
as the sum of three primes is due to Vinogradov. Analytic bounds fall into several 
categories, or stages; quite often, successive versions of the same theorem will go 
through successive stages. 

1. An ineffective result shows that a statement is true for some constant C, but gives 
no way to determine what the constant C might be. Vinogradov’s first proof of 
his theorem (in 0Vin37ll l is like this: it shows that there exists a constant C such 
that every odd number n > C is the sum of three primes, yet give us no hope of 
finding out what the constant C might be0 Many proofs of Vinogradov’s result 
in textbooks are also of this type. 

2. An effective, but not explicit, result shows that a statement is true for some 
unspecified constant C in a way that makes it clear that a constant C could 
in principle be determined following and reworking the proof with great care. 
Vinogradov’s later proof f llVin47l . translated in mm ) is of this nature. As 
Chudakov IIChu471 §IV.2] pointed out, the improvement on 0Vin37ll given ^ 
Mardzhanishvili MMar41ll already had the effect of making the result effective^ 

3. An explicit result gives a value of C. According to IIChu47l p. 201], the first 
explicit version of Vinogradov’s result was given by Borozdkin in his unpub¬ 
lished doctoral dissertation, written under the direction of Vinogradov (1939): 
C = exp(exp(exp(41.96))). Such a result is, by definition, also effective. 
Borodzkin later IIBor56l gave the value C = e® , though he does not seem to 
have published the proof. The best - that is, smallest - value of C known before 
the present work was that of Liu and Wang OLW02I : C = 2 ■ 

4. What we may call an efficient proof gives a reasonable value for C - in our case, 
a value small enough that checking all cases up to C is feasible. 

How far were we from an efficient proof? That is, what sort of computation could 
ever be feasible? The situation was paradoxical: the conjecture was known above an 
explicit C, but C = 2 -is so large that it could not be said that the problem could 
be attacked by any foreseeable computational means within our physical universe. (A 
truly brute-force verification up to C takes at least C steps; a cleverer verification takes 
well over y/C steps. The number of picoseconds since the beginning of the universe is 
less than 10^°, whereas the number of protons in the observable universe is currently 
estimated at ^ 10®° IIShu92ll ; this limits the number of steps that can be taken in 
any currently imaginable computer, even if it were to do parallel processing on an 
astronomical scale.) Thus, the only way forward was a series of drastic improvements 
in the mathematical, rather than computational, side. 

I gave a proof with C = 10^° in May 2013. Since D. Platt and I had verified 
the conjecture for all odd numbers up to n < 8.8 • 10®° by computer 0HP131 . this 
established the conjecture for all odd numbers n. 

^Here, as is often the case in ineffective results in analytic number theory, the underlying issue is that of 
Siegel zeros, which are believed not to exist, but have not been shown not to; the strongest bounds on (i.e., 
against) such zeros are ineffective, and so are all of the many results using such estimates. 

^The proof in IMar411 combined the bounds in IVin371 with a more careful accounting of the effect of 
the single possible Siegel zero within range. 
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(In December 2013, I reduced C to 10^^. The verification of the ternary Gold- 
bach conjecture up to n < can be done on a home computer over a weekend, 
as of the time of writing (2014). It must be said that this uses the verification of the 
binary Goldbach conjecture for n < 4 • 10^® IIOeSHP141 . which itself required com¬ 
putational resources far outside the home-computing range. Checking the conjecture 
up to n < 10^^ was not even the main computational task that needed to be accom¬ 
plished to establish the Main Theorem - that task was the finite verification of zeros of 
L-functions in IPlabl . a general-purpose computation that should be useful elsewhere.) 

What was the strategy of the proof? The basic framework is the one pioneered by 
Hardy and Littlewood for a variety of problems - namely, the circle method, which, as 
we shall see, is an application of Fourier analysis over Z. (There are other, later routes 
to Vinogradov’s result; see IIHB85I . IIFI98I and especially the recent work 0Shal4L 
which avoids using anything about zeros of T-functions inside the critical strip.) Vino¬ 
gradov’s proof, like much of the later work on the subject, was based on a detailed 
analysis of exponential sums, i.e., Fourier transforms over Z. So is the proof that we 
will sketch. 

At the same time, the distance between 2 • 10^®"^® and 10^^ is such that we cannot 
hope to get to 10^^ (or any other reasonable constant) by fine-tuning previous work. 
Rather, we must work from scratch, using the basic outline in Vinogradov’s original 
proof and other, initially unrelated, developments in analysis and number theory (no¬ 
tably, the large sieve). Merely improving constants will not do; rather, we must do 
qualitatively better than previous work (by non-constant factors) if we are to have any 
chance to succeed. It is on these qualitative improvements that we will focus. 

* * * 

It is only fair to review some of the progress made between Vinogradov’s time and 
ours. Here we will focus on results; later, we will discuss some of the progress made 
in the techniques of proof. See IIDic66l Ch. XVIII] for the early history of the problem 
(before Hardy and Littlewood); see R. Vaughan’s ICM lecture notes on the ternary 
Goldbach problem IIVau80l for some further details on the history up to 1978. 

In 1933, Schnirelmann proved flSch33l that every integer n > 1 can be written as 
the sum of at most K primes for some unspecified constant K. (This pioneering work 
is now considered to be part of the early history of additive combinatorics.) In 1969, 
Klimov gave an explicit value for K (namely, K = 6 ■ 10®); he later improved the 
constant to K = 115 (with G. Z. Piltay and T. A. Sheptickaja) and K = 55. Later, 
there were results by Vaughan 0Vau77al {K = 27), Deshouillers IIDes77l {K = 26) 
and Riesel-Vaughan ||W8^ {K = 19). 

Ramare showed in 1995 that every even number n > 1 can be written as the sum of 
at most 6 primes IIRam95ll . In 2012, Tao proved IITaol41 that every odd number n > 1 
is the sum of at most 5 primes. 

There have been other avenues of attack towards the strong conjecture. Using ideas 
close to those of Vinogradov’s, Chudakov IIChu37l . IIChu38l . Estermann flEst37l and 
van der Corput Ilvan37ll proved (independently from each other) that almost every even 
number (meaning: all elements of a subset of density 1 in the even numbers) can be 
written as the sum of two primes. In 1973, J.-R. Chen showed 0Che73l that every even 
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number n larger than a constant C can be written as the sum of a prime number and 
the product of at most two primes (n = pi + p 2 ov n — pi + P 2 P 3 )- Incidentally, 
J.-R. Chen himself, together with T.-Z. Wang, was responsible for the best bounds on 
C (for ternary Goldbach) before Lui and Wang: C = exp(exp(11.503)) < 4 • 

IICW89I and C = exp(exp(9.715)) < 6 • ||cw%l . 

Matters are different if one assumes the Generalized Riemann Hypothesis (GRH). 
A careful analysis IIEff99l of Hardy and Littlewood’s work 0HL221 gives that every 
odd number n > 1.24 • 10^° is the sum of three primes if GRH is tru^ According 
to IIEff99l . the same statement with n > 10^^ was proven in the unpublished doctoral 
dissertation of B. Lucke, a student of E. Landau’s, in 1926. Zinoviev IIZin97l improved 
this to n > 10^°. A computer check ( ||DEtRZ97| ; see also IISao98l ) showed that the 
conjecture is true for n < 10^°, thus completing the proof of the ternary Goldbach 
conjecture under the assumption of GRH. What was open until now was, of course, the 
problem of giving an unconditional proof. 


1.2 The circle method: Fourier analysis on Z 

It is common for a hrst course on Eourier analysis to focus on functions over the re¬ 
als satisfying f{x) = f{x + 1), or, what is the same, functions / : K/Z —>■ C. 
Such a function (unless it is fairly pathological) has a Eourier series converging to it; 
this is just the same as saying that / has a Eourier transform / : Z — C dehned 
by f{n) = J^/^f{a)e{-an)da and satisfying /(a) = E„gz (Fourier 
inversion theorem), where e{t) = 

In number theory, we are especially interested in functions / : Z — C. Then things 
are exactly the other way around: provided that / decays reasonably fast as n —±oo 
(or becomes 0 for n large enough), / has a Eourier transform / : K/Z — C defined 
by /(a) = satisfying /(n) = f(a)e{an)da. (Highbrow 

talk: we already knew that Z is the Eourier dual of M/Z, and so, of course, K/Z is 
the Eourier dual of Z.) “Exponential sums” (or “trigonometrical sums”, as in the title 
of IIVin54ll ') are sums of the form of course, the “circle” in “circle 

method” is just a name for K/Z. (To see an actual circle in the complex plane, look at 
the image of K/Z under the map a i—>■ e(a).) 

The study of the Eourier transform / is relevant to additive problems in number 
theory, i.e., questions on the number of ways of writing n as a sum of k integers of 
a particular form. Why? One answer could be that / gives us information about the 
“randomness” of /; if / were the characteristic function of a random set, then /(a) 
would be very small outside a sharp peak at a = 0. 

We can also give a more concrete and immediate answer. Recall that, in general, 
the Eourier transform of a convolution equals the product of the transforms; over Z, 


'*In fact, Hardy, Littlewood and Effinger use an assumption somewhat weaker than GRH: they assume 
that Dirichlet L-functions have no zeroes satisfying 5R(s) > 6, where 0 < 3/4 is arbitrary. (We will review 
Dirichlet L-functions in a minute.) 
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this means that for the additive convolution 

{f*9)in)= fi'mi)9{m2), 

mi ,m2GZ 
mi-\-m2—rL 

the Fourier transform satisfies the simple rule 

f*9{a) = f{a) -gia). 

We can see right away from this that (/ * g){n) can be non-zero only if n can be 
written as n = mi -I- m2 for some mi, m2 such that /(mi) and g{m 2 ) are non-zero. 
Similarly, {f * g*h){n) can be non-zero only if n can be written as n = mi -|-m2 -l-ma 
for some mi, m2, m3 such that /(mi), /2(m2) and are all non-zero. This 

suggests that, to study the ternary Goldbach problem, we define fi, f2, f3 : Z ^ C so 
that they take non-zero values only at the primes. 

Hardy and Littlewood defined /i(n) = f 2 {n) = fsin) = 0 for n non-prime (and 
also for n < 0), and /i(n) = f2{n) = fsin) = (logn)e“"/“ for n prime (where x is 
a parameter to be fixed later). Here the factor is there to provide “fast decay”, 

so that everything converges; as we will see later. Hardy and Littlewood’s choice of 
g-n/a; (j-^ijjer than some other function of fast decay) comes across in hindsight as 
being very clever, though not quite best-possible. (Their “choice” was, to some extent, 
not a choice, but an artifact of their version of the circle method, which was framed 
in terms of power series, not in terms of exponential sums with arbitrary smoothing 
functions.) The term logn is there for technical reasons - in essence, it makes sense 
to put it there because a random integer around n has a chance of about l/(logn) of 
being prime. 

We can see that (/i * f 2 * /a )("«■) 7^ 0 if and only if n can be written as the sum 
of three primes. Our task is then to show that (/i * f 2 * f 3 )i'n) (i.e., (/ * / * f )iji)) 
is non-zero for every n larger than a constant C ~ 10^^. Since the transform of a 
convolution equals a product of transforms, 

(/i*/ 2 */ 3 )(n) = / fi * f2 * f3{a)e{an)da = f {fif2f3)ia)e{an)da. ( 1 . 2 ) 


Our task is thus to show that the integral J.g^^.^{fi f 2 f 3 ){a)e{an)da is non-zero. 

As it happens, f{a) is particularly large when a is close to a rational with small 
denominator. Moreover, for such a, it turns out we can actually give rather precise 
estimates for f{a). Define 3Jl (called the set of major arcs) to be a union of narrow 
arcs around the nationals with small denominator: 



(a,q) = l 


la 1 \ 
qQ' q qQJ ’ 


where Q is a constant times x/r, and r will be set later. (This is a slight simplification: 
the major-arc set we will actually use in the course of the proof will be a little different. 
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due to a distinction between odd and even q.) We can write 

f (7i/2^)(a)e(an)da = [ ifif2f3)ioi)e{an)da + f {fif2f3)ia)e{an)da, 

«/R/Z t /Jxn 

(1.3) 

where m is the complement (K/Z) \ 97t (called minor arcs). 

Now, we simply do not know how to give precise estimates for /(a) when a is in 
m. However, as Vinogradov realized, one can give reasonable upper bounds on |/(a)| 
for a G m. This suggests the following strategy; show that 


/ I/i (a) 11/ 2 (a) 11/a (a) Ma < / /i(a)/2(Q;)/3(a)e(an)da. 

/m Jm 


(1.4) 


By (1.2 1 and (1.3 1 , this will imply immediately that (/i * f 2 * f 3 )in) > 0, and so we 


will be done. 

The name of circle method is given to the study of additive problems by means of 
Fourier analysis over Z, and, in particular, to the use of a subdivision of the circle K/Z 
into major and minor arcs to estimate the integral of a Fourier transform. There was 
a “circle” already in Hardy and Ramanujan’s work IHROOl . but the subdivision into 
major and minor arcs is due to Hardy and Littlewood, who also applied their method 
to a wide variety of additive problems. (Hence “the Hardy-Littlewood method” as an 
alternative name for the circle method.) For instance, before working on the ternary 
Goldbach conjecture, they studied the question of whether every n > C can be written 
as the sum of fcth powers (Waring’s problem). In fact, they used a subdivision into 
major and minor arcs to study Waring’s problem, and not for the ternary Goldbach 
problem: they had no minor-arc bounds for ternary Goldbach, and their use of GRH 
had the effect of making every a G M/Z yield to a major-arc treatment. 

Vinogradov worked with finite exponential sums, i.e., fi compactly supported. 
From today’s perspective, it is clear that there are applications (such as ours) in which 
it can be more important for fi to be smooth than compactly supported; still, Vino¬ 
gradov’s simplifications were an incentive to further developments. In the case of the 
ternary Goldbach’s problem, his key contribution consisted in the fact that he could 
give bounds on / (a) for a in the minor arcs without using GRH. 

An important note; in the case of the binary Goldbach conjecture, the method fails 
at (1.4 1 , and not before; if our understanding of the actual value of fi (a) is at all correct, 
it is simply not true in general that 


|/i(a)||/ 2 (Q!)MQ! < / fi{a)f 2 {a)e{an)da. 


/m 


Let us see why this is not surprising. Set fi = f 2 = fs = f for simplicity, so that 
we have the integral of the square (/(a))^ for the binary problem, and the integral of 
the cube (/(a))^ for the ternary problem. Squaring, like cubing, amplifies the peaks 
of f{a), which are at the rationals of small denominator and their immediate neighbor¬ 
hoods (the major arcs); however, cubing amplifies the peaks much more than squaring. 
This is why, even though the arcs making up 9J1 are very narrow, J^{f (a))^e{an)da 
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is larger than \ f{a)\^da', that explains the name major arcs - they are not large, but 
they give the major part of the contribution. In contrast, squaring amplifies the peaks 
less, and this is why the absolute value of f(a)^e(an)da is in general smaller than 
fm nobody knows how to prove a precise estimate (and, in particular, 

lower bounds) on f(a) for a € m, the binary Goldbach conjecture is still very much 
out of reach. 

To prove the ternary Goldbach conjecture, it is enough to estimate both sides of 
(|1.4|i for carefully chosen /i, / 2 , /a, and compare them. This is our task from now on. 


1.3 The major arcs Wl 

1.3.1 What do we really know about L-functions and their zeros? 

Before we start, let us give a very brief review of basic analytic number theory (in the 
sense of, say, IIDav671 1. A Dirichlet character x : Z —C of modulus g is a character 
of {jLjqLY lifted to Z. (In other words, x(n) = x(^^ + d) for all n, x(a^) = x(a)x(^) 
for all a, b and x(n) = 0 for (n, q) Y !■) A Dirichlet L-series is dehned by 

OO 

L{s,x) = X! 

n— 1 

for 3?(s) > 1, and by analytic continuation for 3fi(s) < 1. (The Riemann zeta function 
C(s) is the L-function for the trivial character, i.e., the character x such that xijt) = 1 
for all n.) Taking logarithms and then derivatives, we see that 

- TT^ = 51 x(n)A(n)n"", (1.5) 

A(s,x) 

for 3?(s) > 1, where A is the von Mangoldt function (A(n) = logp if n is some prime 
power p“, a > 1, and A(n) = 0 otherwise). 

Dirichlet introduced his characters and L-series so as to study primes in arithmetic 
progressions. In general, and after some work, ([T3]i allows us to restate many sums 
over the primes (such as our Fourier transforms /(a)) as sums over the zeros of L(s, x)- 
A non-trivial zero of L(s, x) is a zero of L(s, x) such that 0 < 3fi(s) < 1. (The other 
zeros are called trivial because we know where they are, namely, at negative integers 
and, in some cases, also on the line 3fi(s) = 0. In order to eliminate all zeros on 
5ft(s) = 0 outside s = 0, it suffices to assume that x is primitive', a primitive character 
modulo q is one that is not induced by (i.e., not the restriction of) any character modulo 
d\q, d < q.) 

The Generalized Riemann Hypothesis for Dirichlet L-functions is the statement 
that, for every Dirichlet character x. every non-trivial zero of L(s, x) satisfies 3?(s) = 
1/2. Of course, the Generalized Riemann Hypothesis (GRH) - and the Riemann Hy¬ 
pothesis, which is the special case of x trivial - remains unproven. Thus, if we want to 
prove unconditional statements, we need to make do with partial results towards GRH. 
Two kinds of such results have been proven: 
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• Zero-free regions. Ever since the late nineteenth century (Hadamard, de la 
Vallee-Poussin) we have known that there are hourglass-shaped regions (more 
precisely, of the shape < cr < 1 — where c is a constant and where we 
write s = a + it) outside which non-trivial zeros cannot lie. Explicit values for 
c are known ||McC84bl . BKad05L BKadl . There is also the Vinogradov-Korobov 
region IIKor58l . flVin58ll . which is broader asymptotically but narrower in most 
of the practical range (see BFor02l . however). 

• Finite verifications of GRH. It is possible to (ask a computer to) prove small, 
hnite fragments of GRH, in the sense of verifying that all non-trivial zeros of 
a given hnite set of L-functions with imaginary part less than some constant H 
lie on the critical line 3fi(s) = 1/2. Such verihcations go back to Riemann, 
who checked the hrst few zeros of C(s)- Large-scale, rigorous computer-based 
verihcations are now a possibility. 

Most work in the literature follows the hrst alternative, though flTaol4l did use a 
hnite verihcation of RH (i.e., GRH for the trivial character). Unfortunately, zero-free 
regions seem too narrow to be useful for the ternary Goldbach problem. Thus, we are 
left with the second alternative. 

In coordination with the present work, Platt IPlabll verihed that all zeros s of L- 
functions for characters x with modulus q < 300000 satisfying 3(s) < Hq lie on the 
line 3fi(s) = 1/2, where 

• Hq = 10®/g for q odd, and 

• Hq= max(10®/g, 200 -I- 7.5 • 10^/g) for q even. 

This was a medium-large computation, taking a few hundreds of thousands of core¬ 
hours on a parallel computer. It used interval arithmetic for the sake of rigor; we will 
later discuss what this means. 

The choice to use a hnite verihcation of GRH, rather than zero-free regions, had 
consequences on the manner in which the major and minor arcs had to be chosen. As 
we shall see, such a verihcation can be used to give very precise bounds on the major 
arcs, but also forces us to dehne them so that they are narrow and their number is 
constant. To be precise: the major arcs were dehned around rationals a/q with q < r, 
r = 300000; moreover, as will become clear, the fact that Hq is hnite will force their 
width to be bounded by c^r jqx, where cq is a constant (say cq = 8). 

1.3.2 Estimates of /(a) for a in the major arcs 

Recall that we want to estimate sums of the type f{a) = ^/(n)e(—cm), where 
f{n) is something like {\ogn)r]{n/x) for n equal to a prime, and 0 otherwise; here 
p : R —> C is some function of fast decay, such as Hardy and Littlewood’s choice. 



for f > 0, 
for t < 0. 
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Let us modify this just a little - we will actually estimate 


Sr,{a,x) = E A{n)e{an)r]{n/x), (1-6) 

where A is the von Mangoldt function (as in ( |1.5[ )) . The use of a rather than —a is just 
a bow to tradition, as is the use of the letter S (for “sum”); however, the use of A(n) 
rather than just plain logp does actually simplify matters. 

The function p here is sometimes called a smoothing function or simply a smooth¬ 
ing. It will indeed be helpful for it to be smooth on (0, oo), but, in principle, it need 
not even be continuous. (Vinogradov’s work implicitly uses, in effect, the “brutal trun¬ 
cation” l[o.i] (f), defined to be 1 when t G [0,1] and 0 otherwise; that would be fine for 
the minor arcs, but, as it will become clear, it is a bad idea as far as the major arcs are 
concerned.) 

Assume Of is on a major arc, meaning that we canwritea = a/g+5/a; for some a/g 
(g small) and some <5 (with |(5| small). We can write Sri{a, x) as a linear combination 



+ tiny error term, 


(1.7) 


where 


In (|1.7|l, V runs over primitive Dirichlet characters of moduli dig, and Cv is small 

Why are we expressing the sums Srj^a, x) in terms of the sums ^)’ which 

look more complicated? The argument has become 5/x, whereas before it was a. 
Here 6 is relatively small - smaller than the constant cqx, in our setup. In other words, 
e{6n/x) will go around the circle a bounded number of times as n goes from 1 up to a 
constant times x (by which time r]{n/x) has become small, because p is of fast decay). 
This makes the sums much easier to estimate. 

To estimate the sums we will use L-functions, together with one of the most 
common tools of analytic number theory, the Mellin transform. This transform is es¬ 
sentially a Laplace transform with a change of variables, and a Laplace transform, in 
turn, is a Fourier transform taken on a vertical line in the complex plane. For / of fast 
enough decay, the Mellin transform F = M/ of / is given by 


'^A{n)x{n)e{6n/x)r]{n/x). (1.8) 



df 

we can express / in terms of F by the Mellin inversion formula 


f{t) = 


^(7 + 200 


27ri 


F{s)t '^ds 


for any a within an interval. We can thus express e{6t)r]{t) in terms of its Mellin 
transform Fs and then use (1.5 1 to express Srj,x in terms of Fs and L'{s, x)/L{s, x); 
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shifting the integral in the Mellin inversion formula to the left, we obtain what is known 
in analytic number theory as an explicit formula: 

Sn^xi.^/x, x) = [r]{—6)x] — ^ Fs{p)x^ + tiny error term. 

p 

Here the term between brackets appears only for x trivial. In the sum, p goes over all 
non-trivial zeros of L(s, x), and Fs is the Mellin transform of e{5t)r]{f). (The tiny error 
term comes from a sum over the trivial zeros of L(s, x)-) We will obtain the estimate 
we desire if we manage to show that the sum over p is small. 

The point is this: if we verify GRH for L{s,x) up to imaginary part H, i.e., if 
we check that all zeroes p of L{s,x) with |9(p)| < H satisfy = 1/2, we have 
\xP\ = ^/x. In other words, x^ is very small (compared to x). However, for any 
p whose imaginary part has absolute value greater than H, we know next to nothing 
about its real part, other than 0 < 3?(p) < 1. (Zero-free regions are notoriously weak 
for 9 (/ 9 ) large; we will not use them.) Hence, our only chance is to make sure that 
Fs{p) is very small when |9(p)| > H. 

This has to be true for both 6 very small (including the case <1 = 0) and for 6 not so 
small (|()| up to corjq, which can be large because r is a large constant). How can we 
choose ry so that Fs{p) is very small in both cases for r = 9(p) large? 

The method of stationary phase is useful as an exploratory tool here. In brief, it 
suggests (and can sometimes prove) that the main contribution to the integral 

dt 

Fs{t) = / e{5t)p{t)C^ (1.9) 

can be found where the phase of the integrand has derivative 0. This happens when 
t = —tI2'x5 (for sgn(T) f sgn((5)); the contribution is then a moderate factor times 
rj{—T/2 'k6). In other words, if sgn(T) f sgn(5) and b is not too small (|()| > 8 , say), 
Fsip + it) behaves like rj{—T/2 'k5)', if <5 is small (|()| < 8 ), then Fg behaves like Fq, 
which is the Mellin transform Mr] of rj. Here is our goal, then: the decay of 77 (f) as 
|f| —> 00 should be as fast as possible, and the decay of the transform Mr]{a + ir) 
should also be as fast as possible. 

This is a classical dilemma, often called the uncertainty principle because it is the 
mathematical fact underlying the physical principle of the same name: you cannot have 
a function ry that decreases extremely rapidly and whose Fourier transform (or, in this 
case, its Mellin transform) also decays extremely rapidly. 

What does “extremely rapidly” mean here? It means (as Hardy himself proved) 
“faster than any exponential Thus, Hardy and Littlewood’s choice 77 (f) = 

seems essentially optimal at first sight. 

However, it is not optimal. We can choose rj so that Mr] decreases exponentially 
(with a constant C somewhat worse than for 7 y(f) = e“*), but ry decreases faster than 
exponentially. This is a particularly appealing possibility because it is t/\6\, and not so 
much f, that risks being fairly small. (To be explicit: say we check GRH for characters 
of modulus q up to Hq ~ 50 • c^r/q > 50|(5|. Then we only know that \t/2tt5\ > 
8 . So, for 77 (f) = e“‘, r]{—T/2Tr5) may be as large as e“®, which is not negligible. 
Indeed, since this term will be multiplied later by other terms, is simply not small 
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enough. On the other hand, we can assume that Hq > 200 (say), and so Mrj{s) ^ 
g-(’^/ 2 )|'r| completely negligible, and will remain negligible even if we replace tt/2 
by a somewhat smaller constant.) 

We shall take p(f) = (that is, the Gaussian). This is not the only possible 

choice, but it is in some sense natural. It is easy to show that the Mellin transform Fg 
for 77 (f) = is a multiple of what is called a parabolic cylinder function U(a, z) 

with imaginary values for z. There are plenty of estimates on parabolic cylinder func¬ 
tions in the literature - but mostly for a and z real, in part because that is one of the 
cases occuring most often in applications. There are some asymptotic expansions and 
estimates for (7(a, z), a, z, general, due to Olver 0Olv58l . 0Olv59l . Il01v61l . 0Olv65L 
but unfortunately they come without fully explicit error terms for a and z within our 
range of interest. (The same holds for IITV03I .1 

In the end, I derived bounds for Fs using the saddle-point method. (The method 
of stationary phase, which we used to choose 77 , seems to lead to error terms that are 
too large.) The saddle-point method consists, in brief, in changing the contour of an 
integral to be bounded (in this case, Jp|) so as to minimize the maximum of the 
integrand. (To use a metaphor in IldBSlI : find the lowest mountain pass.) 

Here we strive to get clean bounds, rather than the best possible constants. Consider 
the case fc = 0 of Corollary |8.0.2| with fc = 0; it states the following. For s = a -\- ir 
with cr € [0,1] and |r| > max(100, 47 r^| 5 |), we obtain that the Mellin transform Fs of 
ri{t)e{dt) with 77 (f) = satisfies 


1^5(5 -f fc)| -f |F 5((1 


if 4|r|/f2 < 3/2. ^ 

3.286e-°'i^®®l^l if4|r|/f2 > 3/2. 


Similar bounds hold for cr in other ranges, thus giving us estimates on the Mellin 
transform Fs for 77 (f) = f^'e“‘ and a in the critical range [0,1]. (We could do a little 
better if we knew the value of tr, but, in our applications, we do not, once we leave 


the range in which GRH has been checked. We will give a bound (Theorem 8.0.1 1 that 


does take a into account, and also reflects and takes advantage of the fact that there 
is a transitional region around |t| ^ (3/2)(7r/(5)^; in practice, however, we will use 
Cor. [ 8 : 0 : 21 ) 

A moment’s thought shows that we can also use ( jl.lOj l to deal with the Mellin 
transform of r]{t)e{St) for any function of the form 77 (f) = (or, more gener¬ 

ally, 77 (f) = f^e“* /^p(f)), where g(t) is any band-limited function. By a band-limited 
function, we could mean a function whose Fourier transform is compactly supported; 
while that is a plausible choice, it turns out to be better to work with functions that are 
band-limited with respect to the Mellin transform - in the sense of being of the form 


g{t) = / h{r)t "dr, 
J-R 


where fi : K C is supported on a compact interval [—i?, R], with R not too large (say 
R = 200). What happens is that the Mellin transform of the product e~* ^ ‘^g{t) e(6f) 
is a convolution of the Mellin transform Fs{s) of (estimated in (l.lOl) and 
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that of g{t) (supported in [—i?, i?]); the effect of the convolution is just to delay decay 
of Fs{s) by, at most, a shift hy y ^ y — R. 

We wish to estimate Srf^^{S/x) for several functions rj. This motivates us to derive 
an explicit formula (§) general enough to work with all the weights p(t) we will work 
with, while being also completely explicit, and free of any integrals that may be tedious 
to evaluate. 

Once that is done, and once we consider the input provided by Platt’s finite verifi¬ 
cation of GRH up to Hq, we obtain simple bounds for different weights. 

For 77 (f) = X > 10®, x a primitive character of modulus q < r = 300000, 

and any 5 S M with |5| < 4r/q, we obtain 





= Iq=i ■ r]{-6)x + E ■ X, 


where Iq=\ = 1 if g = 1 , Iq=i = 0 if g 7 ^ 1 , and 


\E\ < 4.306- 10"^^ 



/650400 

V 


112 


( 1 . 11 ) 


( 1 . 12 ) 


Here rj stands for the Fourier transform from K to R normalized as follows: 77(f) = 
e(—a:f)77(x)tfx. Thus, 77(— (5) is just ^ (self-duality of the Gaussian). 

This is one of the main results of Part|^ see §7.1| Similar bounds are also proven 
there for 77(f) = f^e“* as well as for a weight of type 77(f) = fe“* where 

g{t) is a band-limited function, and also for a weight 77 defined by a multiplicative 
convolution. The conditions on g (namely, g < r = 300000) and S are what we 
expected from the outset. 

Thus concludes our treatment of the major arcs. This is arguably the easiest part of 
the proof; it was actually what I left for the end, as I was fairly confident it would work 
out. Minor-arc estimates are more delicate; let us now examine them. 


1.4 The minor arcs m 

1.4.1 Qualitative goals and main ideas 

What kind of bounds do we need? What is there in the literature? 

We wish to obtain upper bounds on |5'^(Q!,a7)| for some weight 77 and any a G R/Z 
not very close to a rational with small denominator. Every a is close to some rational 
a/q; what we are looking for is a bound on \Srj{a, a;)| that decreases rapidly when g 
increases. 

Moreover, we want our bound to decrease rapidly when 6 increases, where a = 
a/q + 6/x. In fact, the main terms in our bound will be decreasing functions of 
max(l, |(5|/8) • g. (Let us write Sq = max(2, |(5|/4) from now on.) This will allow 
our bound to be good enough outside narrow major arcs, which will get narrower and 
narrower as g increases - that is, precisely the kind of major arcs we were presupposing 
in our major-arc bounds. 
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It would be possible to work with narrow major arcs that become narrower as q 
increases simply by allowing q to be very large (close to x), and assigning each angle 
to the fraction closest to it. This is, in fact, the common procedure. However, this 
makes matters more difficult, in that we would have to minimize at the same time the 
factors in front of terms x/q, xjy^, etc., and those in front of terms q, y/qx, and so 
on. (These terms are being compared to the trivial bound x.) Instead, we choose to 
strive for a direct dependence on 6 throughout; this will allow us to cap g at a much 
lower level, thus making terms such as q and y/qx negligible. (This choice has been 
taken elsewhere in applications of the circle method, but, strangely, seems absent from 
previous work on the ternary Goldbach conjecture.) 

How good must our bounds be? Since the major-arc bounds are valid only for 
q < r = 300000 and |(5| < 4r/g, we cannot afford even a single factor of log a; (or 
any other function tending to oo as x —oo) in front of terms such as x/ ^/q\So\■ a 
factor like that would make the term larger than the trivial bound x if g|(5o| is equal to 
a constant (r, say) and x is very large. Apparently, there was no such “log-free bound” 
with explicit constants in the literature, even though such bounds were considered to 
be in principle feasible, and even though previous work ( IIChe85L IIDab96l , IIDROIL 
IITaol4l ) had gradually decreased the number of factors of log x. (In limited ranges for 
q, there were log-free bounds without explicit constants; see IIDab96l . BRamlOI . The 
estimate in IIVin541 Thm. 2a, 2b] was almost log-free, but not quite. There were also 
bounds llKar931 , OButlll that used L-functions, and thus were not really useful in a 
truly minor-arc regime.) 

It also seemed clear that a main bound proportional to (log g)^x /^ (as in ITaolTI l 
was too large. At the same time, it was not really necessary to reach a bound of the 
best possible form that could be found through Vinogradov’s basic approach, namely 


\Srj{a,x)\ < C 


Xy/q 


(1.13) 


Such a bound had been proven by Ramare BRamlOl for g in a limited range and C 
non-explicit; later, in IIRamcl - which postdates the hrst version of IIHelbll - Ramare 
broadened the range to g < x^/^® and gave an explicit value for C, namely, C = 13000. 
Such a bound is a notable achievement, but, unfortunately, it is not useful for our 
purposes. Rather, we will aim at a bound whose main term is bounded by a c onstan t 
around 1 times x(log Soq)/^/So4>{q)', this is slightly worse asymptotically than (1.131, 
but it is much better in the delicate range of Soq ~ 300000, and in fact for a much 
wider range as well. 


* * * 

We see that we have several tasks. One of them is the removal of logarithms: we 
cannot afford a single factor of log x, and, in practice, we can afford at most one factor 
of log g. Removing logarithms will be possible in part because of the use of previously 
existing efficient techniques (the large sieve for sequences with prime support) but also 
because we will be able to find cancellation at several places in sums coming from a 
combinatorial identity (namely, Vaughan’s identity). The task of finding cancellation 
is particularly delicate because we cannot afford large constants or, for that matter. 
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statements valid only for large x. (Bounding a sum such as efficiently, where 

fi is the Mobius function 


fi{n) 


(—1)^ if n = pip2 ■ ■ - Pk, all Pi distinct 
0 if p^\n for some prime p, 


is harder than estimating a sum such as equally efficiently, even though we 

are used to thinking of the two problems as equivalent.) 

We have said that our bounds will improve as |(5| increases. This dependence on 
6 will be secured in different ways at different places. Sometimes 6 will appear as 
an argument, as in r]{—6)', for p piecewise continuous with p' G Li, we know that 
|p(t)| —)• 0 as |f| —)■ c». Sometimes we will obtain a dependence on 5 by using several 
different rational approximations to the same a € M. Lastly, we will obtain a good 
dependence on 5 in bilinear sums by supplying a scattered input to a large sieve. 

If there is a main moral to the argument, it lies in the close relation between the 
circle method and the large sieve. The circle method rests on the estimation of an 
integral involving a Fourier transform / : M/Z —C; as we will later see, this leads 
naturally to estimating the .^ 2 -norm of / on subsets (namely, unions of arcs) of the circle 
K/Z. The large sieve can be seen as an approximate discrete version of Plancherel’s 
identity, which states that |/|2 = \f\ 2 - 

Both in this section and in §1.5| we shall use the large sieve in part so as to use 
the fact that some of the functions we work with have prime support, i.e., are non-zero 
only on prime numbers. There are ways to use prime support to improve the output 
of the large sieve. In 11.5 these techniques will be refined and then translated to the 
context of the circle method, where / has (essentially) prime support and |/p must be 
integrated over unions of arcs. (This allows us to remove a logarithm.) The main point 
is that the large sieve is not being used as a black box; rather, we can adapt ideas from 
(say) the large-sieve context and apply them to the circle method. 

Lastly, there are the benefits of a continuous p. Hardy and Littlewood already 
used a continuous p; this was abandoned by Vinogradov, presumably for the sake of 
simplicity. The idea that smooth weights rj can be superior to sharp truncations is 
now commonplace. As we shall see, using a continuous p is helpful in the minor-arcs 
regime, but not as crucial there as for the major arcs. We will not use a smooth p; we 
will prove our estimates for any continuous rj that is piecewise Ci, and then, towards 
the end, we will choose to use the same weight p = 772 as in IITaol4l . in part because it 
has compact support, and in part for the sake of comparison. The moral here is not quite 
the common dictum “always smooth”, but rather that different kinds of smoothing can 
be appropriate for different tasks; in the end, we will show how to coordinate different 
smoothing functions 77 . 

There are other ideas involved; for instance, some of Vinogradov’s lemmas are 
improved. Let us now go into some of the details. 


1.4.2 Combinatorial identities 

Generally, since Vinogradov, a treatment of the minor arcs starts with a combinatorial 
identity expressing A(n) (or the characteristic function of the primes) as a sum of two 
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or more convolutions. (In this section, by a convolution f*g, we will mean the Dirichlet 
convolution (/ * g){n) = J2d\n f{d)g{n/d), i.e., the multiplicative convolution on the 
semigroup of positive integers.) 

In some sense, the archetypical identity is 

A = p * log, 

but it will not usually do: the contribution of g,{d) log(n/d) with d close to n is too 
difficult to estimate precisely. There are alternatives: for example, there is the identity 

A(n) logn =/r * log^ — A * A, (1-14) 

which underlies an estimate of Selberg’s that, in turn, is the basis for the Erdos-Selberg 
proof of the prime number theorem; see, e.g., OMV07I §8.2]. More generally, one 
can decompose A(n)(logn)^ as /r * log^^^ minus a linear combination of convolu¬ 
tions; this kind of decomposition - really just a direct consequence of the develop¬ 
ment of (C^(s)/C('S))*^^^ - will be familiar to some from the exposition of Bombieri’s 
work IIBom76l in OFIIOI §3] (for instance). Another useful identity was that used by 
Daboussi ||Dab96l ; witness its application in BDROIL which gives explicit estimates on 
exponential sums over primes. 

The proof of Vinogradov’s three-prime result was simplihed substantially 0Vau77bl 
by the introduction of Vaughan’s identity: 

A(n) = g<u * log —A<y * fj,<u * 1 -I-1 * /i>j/ * A>y -I- A<y, (1-15) 

where we are using the notation 

//(’^) tfn< W, Jo if n < W, 

if n>W, ifn>W. 

Of the resulting sums * iog){n)e{an)ri{n/x), etc.), the hrst three are said 

to be of type /, type I (again) and type IT, the last sum, J2n<v negligible. 

One of the advantages of Vaughan’s identity is its flexibility: we can set U and V 
to whatever values we wish. Its main disadvantage is that it is not “log-free”, in that it 
seems to impose the loss of two factors of log x: if we sum each side of ( |1.15[ ) from 1 
to X, we obtain J2n<x ~ on the left side, whereas, if we bound the sum on the 
right side without the use of cancellation, we obtain a bound of a:(logx)^. Of course, 
we will obtain some cancellation from the phase e{an); still, even if this gives us a 
factor of, say, l/y4j, we will get a bound of x(loga;)^/y^, which is worse than the 
trivial bound x for q bounded and x large. Since we want a bound that is useful for all 
q larger than the constant r and all x larger than a constant, this will not do. 

As was pointed out in IITaol4l . it is possible to get a factor of (logq)^ instead of a 
factor of (logcc)^ in the type II sums by setting U and V appropriately. Unfortunately, 
a factor of (log is still too large in practice, and there is also the issue of factors of 
log X in type I sums. 

Vinogradov had already managed to get an essentially log-free result (by a rather 
difficult procedure) in 0Vin541 Ch. IX]. The result in ||Dab96l is log-free. Unfortu¬ 
nately, the explicit result in BDROll - the study of which encouraged me at the begin¬ 
ning of the project - is not. For a while, I worked with the case fc = 2 of the expansion 
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of (C'(s)/C(s))*'^\ which gives 

A • log^ = ^ * log^ —3 • (A • log) *A — A*A*A. 


(1.16) 


This identity is essentially log-free: while a trivial bound on the sum of the right side 
for n from 1 to iV does seem to have two extra factors of log, they are present only in 
the term ^ * log^, which is not the hardest one to estimate. Ramare obtained a log-free 
bound in BRamlOl using an identity introduced by Diamond and Steinig in the course 
of their own work on elementary proofs of the prime number theorem IIDS70II : that 
identity gives a decomposition for A • log^ that can also be derived from the expansion 
of {('(s )/by a clever grouping of terms. 

In the end, I decided to use Vaughan’s identity, motivated in part by 0Taol4l . and 
in part by the lack of free parameters in (1.16 1 ; as can be seen in ( |1.15| l, Vaughan’s 
identity has two parameters U, V that we can set to whatever values we think best. The 
form of the identity allowed me to reuse much of my work up to that point, but it also 
posed a challenge: since Vaughan’s identity is by no means log-free, one has obtain 
cancellation in Vaughan’s identity at every possible step, beyond the cancellation given 
by the phase e{an). (The presence of a phase, in fact, makes the task of getting can¬ 
cellation from the identity more complicated.) The removal of logarithms will be one 
of our main tasks in what follows. It is clear that the presence of the Mobius function 
/r should give, in principle, some cancellation; we will show how to use it to obtain as 
much cancellation as we need - with good constants, and not just asymptotically. 


1.4.3 Type I sums 

There are two type I sums, namely, 

/r(?7i.) y^(logn)e(amn)77 ( - J (1-17) 

m<U n 

and 

A(u) /i(u) e{avun)ri _ (1.18) 

v<V u<U n 

In either case, a = ajq + 6/x, where q is larger than a constant r and |i5/a:| < l/qQo 
for some Qq > max(( 7 , y^). For the purposes of this exposition, we will set it as our 
task to estimate the slightly simpler sum 

e{amn)ri (1-19) 

m<D n 


where D can be U or UV or something else less than x. 

Why can we consider this simpler sum without omitting anything essential? It is 
clear that ( 1.17 1 is of the same kind as (|1.19 1. The inner double sum in ( 1.18 1 is just 
(1.19 1 with av instead of a; this enables us to estimate ( 1.18| l by means of ( 1.19 1 for q 
small, i.e., the more delicate case. If q is not small, then the approximation av ~ av/q 
may not be accurate enough. In that case, we collapse the two outer sums in ( 1.18| l into 
a sum J2ni-^<v * T<u)(n), and treat all of (1.18 1 much as we will treat (1.19 1 ; since 



























1.4. THE MINOR ARCS dJl 


19 


q is not small, we can afford to bound (A<y * ij,<u){n) trivially (by logn) in the less 
sensitive terms. 

Let us first outline Vinogradov’s procedure for bounding type I sums. Just by sum¬ 
ming a geometric series, we get 


^ e{an) 

n<N 


< min 



( 1 . 20 ) 


where c is a constant and {a} is the distance from a to the nearest integer. Vinogradov 
splits the outer sum in ( |1.19| l into sums of length q. When m runs on an interval of 
length q, the angle am/q runs through all fractions of the form b/q-, due to the error 
S/x, am could be close to 0 for two values of n, but otherwise {am} takes values 
bounded below by 1/q (twice), 2/q (twice), 3/g (twice), etc. Thus 


E 

/i(m) e{amn) 

^ E 

e{amn) 

y<m<y+q 

n<N 

y<m<y+q 

n<N 


< 


2N 

m 


+ 2cq log eq 


( 1 . 21 ) 


for any y >0. 

There are several ways to improve this. One is simply to estimate the inner sum 
more precisely; this was already done in IIDROlll . One can also define a smoothing 
function 77 , as in (|1.19|l; it is easy to get 


e(an)?? 

n<N 



< min 


(^x\r]\i + 



W\l \v"\oo \ 

2| sin( 7 ra)| ’ 4 x(sin 7 ra)^ j 


Except for the third term, this is as in flTaol41 . We could also choose carefully which 
bound to use for each m; surprisingly, this gives an improvement - in fact, an impor¬ 
tant one, for m large. However, even with these improvements, we still have a term 


proportional to N/m as in (1.21 1 , and this contributes about {x\ogx)/q to the sum 


( 1. 19| l, thus giving us an estimate that is not log-free. 

What we have to do, naturally, is to take out the terms with q|m for m small. (If m 
is large, then those may not be the terms for which ma is close to 0 ; we will later see 
what to do.) For y + q < Q/2, \a — a/q\ < IjqQ, we get that 


E ^ 

y<m<y+q 


B 


sm Tran 


C 


sm Tran ■ 


( 1 . 22 ) 


IS at most 


20 

Stt^ 


Cq^, 2A + —Vac, — max f 2, log 

TT TT V BtT J 


(1.23) 


This is satisfactory. We are left with all the terms m < M = min(U, Q/2) with q\m 
- and also with all the terms Q/2 < m < D. For m < M divisible by q, we can 
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estimate (as opposed to just bound from above) the inner sum in ( 1.19| l by the Poisson 
summation formula, and then sum over m, but without taking absolute values; writing 
m = aq, we get a main term 


q a 

^ a<Mlq 

{a,q) = l 


(1.24) 


where (a, q) stands for the greatest common divisor of a and q. 

It is clear that we have to get cancellation over ^ here. There is an elegant elemen¬ 
tary argument IIGR96I showing that the absolute value of the sum in ( |1.24| i is at most 
1. We need to gain one more log, however. Ramare BRambl helpfully furnished the 
following bound; 


E 

a<.x 


T{a) 

a 


(a,q) = l 


4 q 1 

5 \ogx/q 


(1.25) 


for q < X. (Cf. BEM95II . IIEM96I 1 This is neither trivial nor elementary]^ We are, so to 
speak, allowed to use non-elementary means (that is, methods based on L-functions) 
because the only L-function we need to use here is the Riemann zeta function. 

What shall we do for m > Q/21 We can always give a bound 


y<m<y+q 


c 


sin Tran ■ 




(1.26) 


for y arbitrary; since AC will be of constant size, {Aq/Ti)^/AC is pleasant enough, but 
the contribution of 3^ ^ 3|77|iai/2/ is nasty (it adds a multiple of {x\ogx)/q to the 
total) and seems unavoidable: the values of m for which am is close to 0 no longer 
correspond to the congruence class m = 0 mod q, and thus cannot be taken out. 

The solution is to switch approximations. (The idea of using different approxima¬ 
tions to the same a is neither new nor recent in the general context of the circle method; 
see IIVau971 §2.8, Ex. 2]. What may be new is its use to clear a hurdle in type I sums.) 
What does this mean? If a were exactly, or almost exactly, a/q, then there would be 
no other very good approximations in a reasonable range. However, note that we can 
define Q = for a = ajq + Sjx, and still have \a — a/q\ < l/qQ- If 5 is very 

small, Q will be larger than 2D, and there will be no terms with Q/2 < m < D to 
worry about. 

^The current state of knowledge may seem surprising: after all, we expect nearly square-root cancella¬ 
tion - for instance, | Tr(n)/n| < \f2jx holds for all real 0 < a; < 10^^; see also the stronger 

bound IDre93l i. The classical zero-free region of the Riemann zeta function ought to give a factor of 
exp(—^ (log x)lc), which looks much better than 1/log a;. What happens is that (a) such a factor is 
not actually much better than 1/ log a: for x ~ 10^®, say; (b) estimating sums involving the Mdhius func¬ 
tion by means of an explicit formula is harder than estimating sums involving A(n): the residues of 1/C('5) 
at the non-trivial zeros of s come into play. As a result, getting non-trivial explicit results on sums of /i(n) 
is harder than one would naively expect from the quality of classical effective (but non-explicit) results. See 
IRamal for a survey of explicit bounds. 
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What happens if S is not very small? We know that, for any Q', there is an approx¬ 
imation a'/g' to a with \a — a'/q'\ < Ijg'Q' and q' < Q'. However, for Q' > Q, we 
know that a'/g' cannot equal a/g: by the definition of Q, the approximation ajq is not 
good enough, i.e., \a — a/q\ < IjqQ' does not hold. Since a/g ^ a!/q', we see that 
|a/g — a'/g'l > 1/gg', and this implies that q' > (e/(1 + e))Q- 

Thus, for TO > (5/2, the solution is to apply ( 1.26| l with a'/g' instead of a/g. The 
contribution of A fades into insignificance: for the first sum over a range y < m < 
y + q', y > <5/2, it contributes at most xj{Qj2), and all the other contributions of A 
sum up to at most a constant times {x log x) /g'. 

Proceeding in this way, we obtain a total bound for (1.19i whose main terms are 
proportional to 


1,^^, ^\/W'\oo ■ D and glogmax g^ , (1.27) 

with good, explicit constants. The first term - usually the largest one - is precisely what 
we needed: it is proportional to {l/(f>{q))x/ logx for g small, and decreases rapidly as 
I(5 1 increases. 


</(<?) logf 


1.4.4 Type II, or bilinear, sums 

We must now bound 

5 = E(i * T>u){'m) E A{n)e{amn)ri{mn / x). 

m n>V 


At this point it is convenient to assume that rj is the Mellin convolution of two functions. 
The multiplicative or Mellin convolution on IR+ is defined by 


ivo *M 



dr 

r 


Tao IITaol4l takes rj = ri2 = h *m where rji is a brutal truncation, viz., the 
function taking the value 2 on [1/2,1] and 0 elsewhere. We take the same 772 , in part 
for comparison purposes, and in part because this will allow us to use off-the-shelf 
estimates on the large sieve. (Brutal truncations are rarely optimal in principle, but, as 
they are very common, results for them have been carefully optimized in the literature.) 
Clearly 


S = 


fX/u 

( \ 

L ^ 

E 

m 

d>U 

\ (i|m / 


E (^) • E Mn)e(amn)yi (^) (1.28) 

m d>U ^ ' n>V 
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By Cauchy-Schwarz, the integrand is at most y/Si{U, W)S 2 {V, W), where 


Si{U,W) 


S2{V,W) 


E 


E 


E 


d>U 

d\m 


A{n)e{amn) 

max ^ y, ) < n < ty 


(1.29) 


We must bound Si{U, W) by a constant times xjW. We are able to do this - with 
a good constant. (A careless bound would have given a multiple of {x/U) \og^{x/U), 
which is much too large.) First, we reduce S'i(VF) to an expression involving an inte¬ 
gral of 


EE 

r\<x r2"^x 


o-(ri)cr(r2) ■ 


(ri.r-2) = l 


(1.30) 


We can bound (1.30 1 by the use of bounds on J2n<t combined with the es¬ 

timation of infinite products by means of approximations to t^{s) for s —1^. After 
some additional manipulations, we obtain a bound for Si{U, W) whose main term is 
at most (3/7r^)(a;/hF) for each W, and closer to 0.22482a;/VF on average over W. 

(This is as good a point as any to say that, throughout, we can use a trick in IITaol4l 
that allows us to work with odd values of integer variables throughout, instead of letting 
m or n range over all integers. Here, for instance, if m and n are restricted to be odd, 
we obtain a bound of (2/7r^)(a;/kF) for individual W, and 0.15107a;/kF on average 
over W. This is so even though we are losing some cancellation in /i by the restriction.) 

Let us now bound S 2 {V,W). This is traditionally done by Linnik’s dispersion 
method. However, it should be clear that the thing to do nowadays is to use a large 
sieve, and, more specifically, a large sieve for primes; that kind of large sieve is nothing 
other than a tool for estimating expressions such as S 2 {V,W). (Incidentally, even 
though we are trying to save every factor of log we can, we choose not to use small 
sieves at all, either here or elsewhere.) In order to take advantage of prime support, we 
use Montgomery’s inequality ( 0Mon68l . IIHux72l : see the expositions in 0Mon71l pp. 
27-29] and IIIK04I §7.4]) combined with Montgomery and Vaughan’s large sieve with 
weights 0MV73I (1.6)], following the general procedure in 0MV73I (1.6)]. We obtain a 
bound of the form 


log VF / X qW\W 

log|| V#(9) </'(?)/ 2 


(1.31) 


on S' 2 (L, W), where, of course, we can also choose not to gain a factor of \ogW/2q if 
q is close to or greater than W. 

It remains to see how to gain a factor of |(5| in the major arcs, and more specihcally 
in 5*2 (L, W). To explain this, let us step back and take a look at what the large sieve is. 
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Given a civilized function / : Z —C, Plancherel’s identity tells us that 



2 

da 


Ei/wi'- 


The large sieve can be seen as an approximate, or statistical, version of this; for a 
“sample” of points ai,a 2 ,. ■ ■ ,ak satisfying \ai — aj\ > /3 for i ^ j, it tells us that 




<(X + /3-1)EI/(^)P, 

n 


(1.32) 


assuming that / is supported on an interval of length X. 

Now consider ai = a,a 2 = 2a, 03 = 3a.... If a = a/q, then the angles 



want to do better for |(5| larger than a constant. 

Suppose, then, that a — a/q + 5/x, where |5| > 8, say. Then the angles ai 
and a,j+i are not identical: |ai — a^+ij < q\5\/x. We also see that a^+i is at a 
distance at least q\5\/x from a 2 , 03 ,... a^, provided that q\5\/x < 1/q. We can go 
on with aq+ 2 ^ ctq+s ,..., and stop only once there is overlap, i.e., only once we reach 
am such that m\S\/x > 1/q. We then give all the angles ai,... ,am - which are 
separated by at least q\5\/x from each other - to the large sieve at the same time. We 
do this \L/rn\ < \L/{x/\5\q)\ times, and obtain a total bound of |’L/(a;/|(5|g)] (2f + 
®/l^k) Sn l/(^)P’ which, for L = x/2W, X = IT/2, gives us about 


/ a; IT x\ 

^ 4) 


log IT 


provided that L > x/\5\qwd, as usual, |a—a/q| < 1/qQ. This is very small compared 
to the trivial bound < xW/S. 

What happens if L < x/\5q\l Then there is never any overlap: we consider all 
angles ai, and give them all together to the large sieve. The total bound is (IT^/4 + 
a;IT/ 2 | 5 |( 7 ) log IT. If L = x/2W is smaller than, say, x/^Sq], then we see clearly 
that there are non-intersecting swarms of angles a^ around the rationals a/q. We can 
thus save a factor of log (or rather {(j){q)/q) log(IT/|i5g|)) by applying Montgomery’s 
inequality, which operates by strewing displacements of given angles (or, here, swarms 
around angles) around the circle to the extent possible while keeping everything well- 
separated. In this way, we obtain a bound of the form 


log IT / X g IT\ IT 

'/(9) 2 y 2 ■ 


Compare this to (1.31 1 ; we have gained a factor of \5\/A, and so we use this estimate 
when I (51 > 4. (We will actually use the criterion |(5| > 8 , but, since we will be working 
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with approximations of the form 2a = ajq + 6/x, the value of S in our actual work 
is twice of what it is in this introduction. This is a consequence of working with sums 
over the odd integers, as in IITao 141 .1 

* * * 


We have succeeded in eliminating all factors of log we came across. The only 
factor of log that remains is logx/UV, coming from the integral dWjW. Thus, 
we want UV to be close to x, but we cannot let it be too close, since we also have a 
term proportional to D = UV in ( |1.27| i, and we need to keep it substantially smaller 
than X. We set U and V so that UV is x/ yjqmax.{A, |5|) or thereabouts. 

In the end, after some work, we obtain our main minor-arcs bound (Theorem |3.1.1| l. 
It states the following. Let x > xq, xq = 2.16 • 10^°. Tecall that Sri{a,x) = 
A(n)e(an)77(n/x) andp 2 = ??i*m?7i = 4-l[i/ 2 ,i] * l[i/ 2 ,i]- Let 2a = ajq + 6lx, 
Q < Q, gcd(a, q) = 1, \5/x\ < I/qQ, where Q = (3/4)a;^/^. If g < then 


IC, , i?a:.5oglog(5oq-f 0.5 

\Srj{a,x)\ < -- 

V oo(p{q) 


X ■ 


y%q Ooq 


n.331 




6o = max(2, |(5|/4), 


= 0.27125 log 



log4f \ 

2 log 9x1/3 I 
^ 2.004t / 


-f 0.41415, 


d^x,t,q — 


</'(?) 


^ logf -f 7.82 


-f 13.66 log f-f 37.55. 


(1.34) 


The factor is small in practice; for typical “difficult” values of x and 5qx, it is 
less than 1. The crucial things to notice in ( |L33| l are that there is no factor of log x, and 
that, in the main term, there is only one factor of log d^q. The fact that Jg helps us as 
it grows is precisely what enables us to take major arcs that get narrower and narrower 
as q grows. 


1.5 Integrals over the major and minor arcs 


So far, we have sketched (j |L3| l how to estimate 5”^(a, x) for a in the major arcs and 
r] based on the Gaussian and also (i 1.4 1 how to bound \Sri{a, x) \ for a in the 


minor arcs and q = q 2 , where 772 = 4 • l[i/ 2 ,i] *m 1 [ i / 2 , i ]- We now must show how to 
use such information to estimate integrals such as the ones in (|L4|l. 


We will use two smoothing functions p+, p*; in the notation of (1.3 1 , we set /i = 


/2 = A(n)? 7 +(n/x), /g = A{n)q^{n/x), and so we must give a lower bound for 


/ {Srj^{a,x))'^Srj,{a,x)e{—an)da 

Jw 


(1.35) 


and an upper bound for 


/ (a, x) 1 ^ Srj, (a, x)e{—an)da 

J m 


(1.36) 
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so that we can verify ( |1.4| l. 

The traditional approach to (|1.36[) is to bound 


/ {^7]+ (cKj {a, x)e{—an)da < / (a, x) I ^ da • max ffl (a) 

Jm Jm aGm 

< A(n)^pi f—) ■ max Sr,, {a, x). 

^' \X/ aem 


(1.37) 


Since the sum over n is of the order of x log x, this is not log-free, and so cannot be 
good enough; we will later see how to do better. Still, this gets the main shape right; 
our bound on (1.36 1 will be proportional to Ip+lfljy*!!. Moreover, we see that 77 * has 
to be such that we know how to bound \Sri, (a, x) | for a G m, while our choice of rj^ 
is more or less free, at least as far as the minor arcs are concerned. 

What about the major arcs? In order to do anything on them, we will have to be 
able to estimate both ? 7 +(a) and 77 * (a) for a G 9Jt. If that is the case, then, as we 
shall see, we will be able to obtain that the main term of (1.35 1 is an infinite product 
(independent of the smoothing functions), times x^, times 


(r/+(—a))^T/l(—a)e(—an/x)da 


r/+(ti)7f+(t2)v* - (^1 +^ 2 )) dtidt2 


(1.38) 


In other words, we want to maximize (or nearly maximize) the expression on the right 
of ( 1.38 1 divided by | 77 +||| 77 *|i. 

One way to do this is to let 77 * be concentrated on a small interval [0, e). Then the 
right side of (1.38 1 is approximately 




dt. 


(1.39) 


To maximize (1.39 1 , we should make sure that 77 +(f) ^ ri+{n/x — t). We set x ~ 77 / 2 , 
and see that we should define 77 + so that it is suppo rted o n [ 0 , 2 ] and symmetric around 
f = 1, or nearly so; this will maximize the ratio of (1.39 1 to | 7 ?+| 2 l? 7 *|i. 

We should do this while making sure that we will know how to estimate (a, x) 
for a G (H. We know how to estimate Srj(a,x) very precisely for functions of the 
form 77 (f) = 77 (f) = p(f)fe“‘^/^, etc., where g{t) is band-limited. We will 

work with a function 77 + of that form, chosen so as to be very close (in £2 norm) to a 
function 770 that is in fact supported on [ 0 , 2 ] and symmetric around f = 1 . 

We choose 

"fy2-f)3e-(*-i)'/2 iffG[0,2], 

^0 iff ^[ 0 , 2 ]. 

This function is obviously symmetric (770 (f) = 770(2 — f)) and vanishes to high order 
at f = 0 , besides being supported on [ 0 , 2 ]. 

We set 77_|_(f) = where hfi(t) is an approximation to the function 


do{t) = 


h{t) = 






iff G [0,2] 
iff ^ [ 0 , 2 ]. 
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We just let hji{t) be the inverse Mellin transform of the truncation of Mh to an interval 
[—iR,iR]. (Explicitly, 


hR{t)= f h{ty — , 

Jo y 

where Fn{t) = sm{R log y)/( 7 r log y), that is, is the Dirichlet kernel with a change 
of variables.) 

Since the Mellin transform of is regular at s = 0 , the Mellin transform 

M 774 . will be holomorphic in a neighborhood of {s : 0 < 3fi(s) < 1 }, even though 
the truncation of Mh to [—iR, iR] is brutal. Set R = 200 , say. By the fast decay of 
Mh{it) and the fact that the Mellin transform M is an isometry, | {hR{t) — h{t))/t \2 is 
very small, and hence so is [ 77 + — P 0 I 2 , as we desired. 

But what about the requirement that we be able to estimate (a, x) for both 
a G m and a G 

Generally speaking, if we know how to estimate Srj^ {a, x) for some a G M/Z and 
we also know how to estimate Sri^{a,x) for all other a G K/Z, where pi and 772 are 
two smoothing functions, then we know how to estimate Srj^{a, x) for all a G M/Z, 
where 773 = 771 V2, or, more generally, 77* (f) = (771 *m 772)(^f), k > 0 a constant. 

This is an easy exercise on exchanging the order of integration and summation: 

Sn, (a, x) = ^ A{n)e{an){r]i *m m) 

^ A(77)e(an)77i(Kr)772 ( —) — = f r]i{Kr)Sn2{rx) — , 

\rx/ r In r 

n ^ 

(1.40) 

and similarly with 771 and 772 switched. Of course, this trick is valid for all exponential 
sums: any function f{n) would do in place of A(7t,). The only caveat is that 771 (and 
772) should be small very near 0, since, for r small, we may not be able to estimate 
Srj 2 (tx) (or Sri^ (rx)) with any precision. This is not a problem; one of our functions 
will be which vanishes to second order at 0, and the other one will be 772 = 

4 ■ l[i/2.i] *M l[i/2,r]^ which has support bounded away from 0 . We will set k large 
(say K = 49 ) so that the support of 77* is indeed concentrated on a small interval [ 0 , e), 
as we wanted. 



Now that we have chosen our smoothing weights 77 + and 77 *, we have to estimate the 
major-arc integral (1.35 1 and the minor-arc integral ( 1.36| l. What follows can actually 
be done for general 77 + and 77 *; we could have left our particular choice of 77 + and 77 * 
for the end. 


Estimating the major-arc integral (1.351 may sound like an easy task, since we have 


rather precise estimates for Sr,{a, x) (77 = when a is on the major arcs; we 


could just replace x) in (1.35 1 by the approximation given by o and ( | 1.1 l| l. It 


IS, however, more efficient to express ( |1.35| l as the sum of the contributio n of th e trivial 
character (a sum of integrals of {r}{—5)x)'^, where rj[—5)x comes from (|l.ll|l), plus a 
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term of the form 


(maximum of • i7(g) for 9 < r) • / \Sr]^{a,x)\'^ da, 

Jm 


where E{q) = is as in (l.l2i, plus two other terms of essentially the same form. As 


usual, the major arcs DJI are the arcs around rationals a/q with q < r. We will soon 
discuss how to bound the integral of {a, x) | over arcs around rationals a/q with 
q < s, s arbitrary. Here, however, it is best to estimate the integral over using the 
estimate on {a, x) from (1.7 1 and (jl.ll|; we obtain a great deal of cancellation, 
with the effect that, for x non-trivial, the error term in (1.12 1 appears only when it gets 
squared, and thus becomes negligible. 

The contribution of the trivial character has an easy approximation, thanks to the 
fast decay of ffo- We obtain that the major-arc integral (1.351 equals a main term 


CoCn,,,n,x^, where 


c„=n(i- 


p\n 


n 1 


^oo /*oo 


a 


Vo,V* 


(p-i)V yv (p-1)^ 

pfn 

'no{ti)Tio{t2)'n* - (^1 + dtidt2 


plus several small error terms. We have already chosen rjo, 77 * and x so as to (nearly) 
maximize C, 


rio.Vt- 


It is time to bound the minor-arc integral (1.36 1 . As we said i n §1.5| we must do 
better than the usual bound (1.37i. Since our minor-arc bound (3.2i on |S'^(a,a;)|, 
a ~ a/q, decreases as q increases, it makes sense to use partial summation together 
with bounds on 


/ \Sr}+{a,x)\‘^ = / \Sr}+{a,x)\^da- / \Srj+{a,x)\‘^da, 

Jms dms dm 


where ntg denotes the arcs around a/q, r < q < s, and SJlg denotes the arcs around all 
cl/t Q s. We already know how to estimate the integral on DJI. How do we bound 
the integral on DJlg ? 

In order to do better than the trivial bound we will need to use the 


fact that the series ( 1.61 dehning (a, x) is essentially supported on prime numbers. 
Bounding the integral on is closely related to the problem of bounding 


2 


^ ^ ^ ane{a/q) 

q<s a mod g n<x 
(a,q) = l 


(1.41) 


efficiently for s considerably smaller than ^/x and a„ supported on the primes ^/x < 
p < X. This is a classical problem in the study of the large sieve. The usual bound on 
( 1.41| i (by, for instance, Montgomery’s inequality) has a gain of a factor of 

2e'^ (logs)/(log x/s^) 
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relative to the bound of (x + s^) '^^at one would get from the large sieve 

without using prime support. Heath-Brown proceeded similarly to bound 



(a, x)\^da < 


2e^ log s 
log x/s"^ 


\Srj_f_{a, x)\'^da. 


(1.42) 


This already gives us the gain of C'(logs)/loga: that we absolutely need, but 
the constant C is suboptimal; the factor in the right side of ( 1 .42| l should really be 
(log s )/ log X, i.e., C should be 1. We cannot reasonably hope to obtain a factor better 
than 2 (log s)/log a; in the minor arcs due to what is known as the parity problem in 
sieve theory. As it turns out, Ramare IIRam09ll had given general bounds on the large 
sieve that were clearly conducive to better bounds on ( |1.41| l, though they involved a 
ratio that was not easy to bound in general. 

I used several careful estimations (including IIRam95l Lem. 3.4]) to reduce the 
problem of bounding this ratio to a hnite number of cases, which I then checked by 
a rigorous computation. This approach gave a bound on ( 1.41| i with a factor of size 
close to 2(logs)/loga;. (This solves the large-sieve problem for s < it would 
still be worthwhile to give a computation-free proof for all s < x^l'^~'^, e > 0.) It was 
then easy to give an analogous bound for the integral over SUlg, namely. 




\Srf_^ {a, x)\^da < 


2 logs 
logx 


\Srj_^^{a,x)f da, 


where < can easily be made precise by replacing logs by logs -I- 1.36 and logx by 
log X + c, where c is a small constant. Without this improvement, the main theorem 
would still have been proved, but the required computation time would have been mul¬ 
tiplied by a factor of considerably more than e^'*' = 5.6499 .... 

What r emain ed then was just to compare the estimates on ( |1.35| l and ( |1.36| l and 
check that (1.36 1 is smaller for n > 10^^. This hnal step was just bookkeeping. As 
we already discussed, a check for n < 10^^ is easy. Thus ends the proof of the main 
theorem. 


1.6 Some remarks on computations 

There were two main computational tasks: verifying the ternary conjecture for all n < 
C, and checking the Generalized Riemann Hypothesis for modulus q < r up to a 
certain height. 

The first task was not very demanding. Platt and I verified in IIHP13II that every 
odd integer 5 < n < 8.8 • lO^o can be written as the sum of three primes. (In the 
end, only a check for 5 < n < 10^^ was needed.) We proceeded as follows. In a 
major computational effort, Oliveira e Silva, Herzog and Pardi BOeSHPMI ) had already 
checked that the binary Goldbach conjecture is true up to 4 • 10^® - that is, every even 
number up to 4 • 10^® is the sum of two primes. Given that, all we had to do was 
to construct a “prime ladder”, that is, a list of primes from 3 up to 8.8 • 10®° such 
that the difference between any two consecutive primes in the list is at least 4 and at 
most 4 • 10^®. (This is a known strategy: see 0Sao98ll .l Then, for any odd integer 
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5 < n < 8.8 • 10^°, there is a prime p in the list such that 4 < n — p < A ■ 10^® + 2. 
(Choose the largest p < nin the ladder, or, if n minus that prime is 2, choose the prime 
immediately under that.) By llOeSHP14ll (and the fact that 4 • 10^® + 2 equals p + q, 
where p = 2000000000000001301 and q = 1999999999999998701 are both prime), 
we can write n — p = pi + P 2 for some primes pi, p 2 , and so n = p + pi + P 2 . 

Building a prime ladder involves only integer arithmetic, that is, computer manip¬ 
ulation of integers, rather than of real numbers. Integers are something that computers 
can handle rapidly and reliably. We look for primes for our ladder only among a spe¬ 
cial set of integers whose primality can be tested deterministically quite quickly (Broth 
numbers; k • 2"^ + 1, k < 2™). Thus, we can build a prime ladder by a rigorous, 
deterministic algorithm that can be (and was) parallelized trivially. 

The second computation is more demanding. It consists in verifying that, for every 
L-function L{s,x) with x of conductor q < r — 300000 (for q even) or q < r /2 
(for q odd), all zeroes of L{s,x) such that |3(s)| < Hq = 10 ®/q (for q odd) and 
|Q'(s)| ^ Hq = max(10®/q, 200 + 7.5 • lO'^/q (for q even) lie on the critical line. 
As a matter of fact, Platt went up to conductor q < 200000 (or twice that for q even) 
IPlabl : he had already gone up to conductor 100000 in his PhD thesis OPlal 11 . The 
verification took, in total, about 400000 core-hours (i.e., the total number of processor 
cores used times the number of hours they ran equals 400000; nowadays, a top-of-the- 
line processor typically has eight cores). In the end, since I used only q < 150000 (or 
twice that for q even), the number of hours actually needed was closer to 160000; since 
I could have made do with q < 120000 (at the cost of increasing C to 10^® or 10®°), it 
is likely, in retrospect, that only about 80000 core-hours were needed. 

Checking zeros of L-functions computationally goes back to Riemann (who did 
it by hand for the special case of the Riemann zeta function). It is also one of the 
things that were tried on digital computers in their early days (by Turing IITur53L for 
instance; see the exposition in HBooOhbll ). One of the main issues to be careful about 
arises whenever one manipulates real numbers via a computer: generally speaking, a 
computer cannot store an irrational number; moreover, while a computer can handle 
rationals, it is really most comfortable handling just those rationals whose denomina¬ 
tors are powers of two. Thus, one cannot really say; “computer, give me the sine of 
that number” and expect a precise result. What one should do, if one really wants to 
prove something (as is the case here!), is to say: “computer, I am giving you an interval 
I = [a/2^,b/2^]-, give me an interval I' = [c/2^,d/2\ preferably very short, such 
that sin(/) C This is called interval arithmetic; it is arguably the easiest way to do 
floating-point computations rigorously. 

Processors do not do this natively, and if interval arithmetic is implemented purely 
on software, computations can be slowed down by a factor of about 100. Fortunately, 
there are ways of running interval-arithmetic computations partly on hardware, partly 
on software. 

Incidentally, there are some basic functions (such as sin) that should always be done 
on software, not just if one wants to use interval arithmetic, but even if one just wants 
reasonably precise results; the implementation of transcendental functions in some of 
the most popular processors does not always round correctly, and errors can accumulate 
quickly. Fortunately, this problem is already well-known, and there is software that 
takes care of this. (Platt and I used the crlibm library ||DLDDD~*~10| .) 
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Lastly, there were several relatively minor computations strewn here and there in 
the proof. There is some numerical integration, done rigorously; once or twice, this 
was done using a standard package based on interval arithmetic llNed06l . but most of 
the time 1 wrote my own routines in C (using Platt’s interval arithmetic package) for 
the sake of speed. Another kind of computation (employed much more in OHelal than 
in the somewhat more polished version of the proof given here) was a rigorous version 
of a “proof by graph” (“the maximum of a function / is clearly less than 4 because I 
can see it on the screen”). There is a standard way to do this (see, e.g., IITCHTT] §5.2]); 
essentially, the bisection method combines naturally with interval arithmetic, as we 
shall describe in |2.6| Yet another computation (and not a very small one) was that 
involved in verifying a large-sieve inequality in an intermediate range (as we discussed 


in §1.5| l. 

It may be interesting to note that one of the inequalities used to estimate ( 1.30| l was 
proven with the help of automatic quantifier elimination BHBl 11 . Proving this inequal¬ 
ity was a very minor task, both computationally and mathematically; in all likelihood, 
it is feasible to give a human-generated proof. Still, it is nice to know from hrst- 
hand experience that computers can nowadays (pretend to) do something other than 
just perform numerical computations - and that this is already applicable in current 
mathematical practice. 













Chapter 2 

Notation and preliminaries 


2.1 General notation 


Given positive integers m, n, we say m\n°° if every prime dividing m also divides n. 
We say a positive integer n is square-full if, for every prime p dividing n, the square 
p^ also divides n. (In particular, 1 is square-full.) We say n is square-free if p^ \ n 
for every prime p. For p prime, n a non-zero integer, we define Vp{n) to be the largest 
non-negative integer a such that p“|n. 

When we write we mean unless the contrary is stated. As always, 

A(n) denotes the von Mangoldt function: 


A(n) 


logp if n = p“ for some prime p and some integer a > 1, 
0 otherwise. 


and p denotes the Mobius function: 


p{n) 


(—1)^ if n = pip2 ■ ■ - Pk, all Pi distinct 
0 if I n for some prime p. 


We let T(n) be the number of divisors of an integer n, uj{n) the number of prime 
divisors of n, and a{n) the sum of the divisors of n. 

We write (a, b) for the greatest common divisor of a and b. If there is any risk 
of confusion with the pair (a, b), we write gcd(a, 6 ). Denote by (a, b°°) the divisor 
of a. (Thus, a/(a, b°°) is coprime to b, and is in fact the maximal divisor 
of a with this property.) 

As is customary, we write e(a;) for We denote the Lr norm of a function / 

by \ f\r- We write 0*{R) to mean a quantity at most R in absolute value. Given a set 
S, we write I 5 for its characteristic function; 


ls(a;) 


1 if a; G S', 

0 otherwise. 


Write log"'" x for max(log x, 0). 
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2.2 Dirichlet characters and L functions 

Let us go over some basic terms. A Dirichlet character x : Z —)■ C of modulus g is a 
character x of i^/qZ)* lifted to Z with the convention that x('fi) = 0 when (n, q) ^ 1. 
(In other words: x is completely multiplicative and periodic modulo g, and vanishes 
on integers not coprime to q.) Again by convention, there is a Dirichlet character of 
modulus <7 = 1, namely, the trivial character xt : ^ C defined by XTin) = 1 for 
every n G Z. 

If X is a character modulo q and x' is a character modulo q'\q such that x(rt) = 
x'in) for all n coprime to q, we say that x' induces x- A character is primitive if it is 
not induced by any character of smaller modulus. Given a character x, we write x* for 
the (uniquely defined) primitive character inducing x- If a character x mod q is induced 
by the trivial character xt^ we say that x is principal and write xo for X (provided the 
modulus q is clear from the context). In other words, Xo(ri) = 1 when (n, g) = 1 and 
Xo{n) = 0 when (n, q) = 0. 

A Dirichlet L-function L{s, x) (x a Dirichlet character) is defined as the analytic 
continuation of to the entire complex plane; there is a pole at s = 1 if x 

is principal. 

A non-trivial zero of L{s, x) is any s G C such that L{s, x) = 0 and 0 < 5ft(s) < 1. 
(In particular, a zero at s = 0 is called “trivial”, even though its contribution can be 
a little tricky to work out. The same would go for the other zeros with 3?(s) = 0 
occuring for x non-primitive, though we will avoid this issue by working mainly with 
X primitive.) The zeros that occur at (some) negative integers are called trivial zeros. 

The critical line is the line 5ft(s) = 1/2 in the complex plane. Thus, the generalized 
Riemann hypothesis for Dirichlet L-functions reads: for every Dirichlet character x, 
all non-trivial zeros of L{s,x) lie on the critical line. Verifiable finite versions of 
the generalized Riemann hypothesis generally read: for every Dirichlet character x of 
modulus q < Q, all non-trivial zeros of L{s, x) with |9(s)| < f{q) lie on the critical 
line (where / : Z —)■ IR+ is some given function). 
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Thus, for instance, if / is compactly supported, continuous and piecewise then / 
decays at least quadratically. 

It could happen that = oo, in which case (2.1 1 is trivial (but not false). In 

practice, we require G Li. In a typical situation, / is differentiable k times except 
at a:i, X 2 ,..., Xk, where it is differentiable only {k — 2) times; the contribution of Xi 
(say) to is then | lim^_^^+ f^'‘~^\x) - lim^_^^- f^'^-'^\x)\. 

The following bound is standard (see, e.g., 0Taol4l Lemma 3.1]); for a G K/Z and 
/ : K —> C compactly supported and piecewise continuous. 


f{n)eian) 




/ • ( 1^1 , 5 I/I 1 

< mm / 1 + - / 1 , -r-— -^ 

' 2 |sm(7ra)| 


( 2 . 2 ) 


(The first bound follows from I/(’^)I — l/li + which, in turn is 

a quick consequence of the fundamental theorem of calculus; the second bound is 
proven by summation by parts.) The alternative bound (l/4)|/"|i/| sin(7ra)p given 
in IITaol4l Lemma 3.1] (for / continuous and piecewise C^) can usually be improved 
by the following estimate. 


Lemma 2.3.1. Let f : 

Then 


for every a G K. 


C be compactly supported, continuous and piecewise C^. 

ilPloo 


f{n)e(an) 


n^Tj 


< 


(sinTra)^ 


(2.3) 


As usual, the assumption of compact support could easily be relaxed to an assump¬ 
tion of fast decay. 

Proof. By the Poisson summation formula, 

00 OO 

Y f{n)e{an)= Y Jin-a). 

n— — oo n= —00 

Since/(f) = f'{t)/{2TTit), 

~ <^) _ Y^ /"(n - a) 

^ ^ 2TTi{n — a) ^ (27ri(n — a))^ 

By Euler’s formula tt cot stt = 1/s + + s) — l/(n — s)), 

„ 1 


E 


(n + s)" 


= — {tt cot stt)' = 


(sin stt) 2 


(2.4) 


Hence 


Y Tin-a) 


<irioo ^ 


1 


(27r(n — a))^ 


= 1 /" 


(27r)2 (sinaTr)^ 


□ 
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The trivial bound |/"|oo < l/"|i> applied to (2.3 i, recovers the bound in 0Taol41 
Lemma 3.1]. In order to do better, we will give a tighter bound for |/"|oo in Appendix 
l^when / is equal to one of our main smoothing functions (/ = 772 )- 

Integrals of multiples of /" (in particular, |/"|i and /") can still be made sense 
of when /" is undefined at a finite number of points, provided / is understood as a 
distribution (and /' has finite total variation). This is the case, in particular, for / = p 2 - 


When we need to estimate /(’^) precisely, we will use the Poisson summation 
formula; 

n n 

We will not have to worry about convergence here, since we will apply the Poisson 
summation formula only to compactly supported functions / whose Fourier transforms 
decay at least quadratically. 


2.4 Mellin transforms 


The Mellin transform of a function f : (0, 00 ) —>■ C is 

poo 

M(j){s) := / (2.5) 

Jo 

If (j){x)x'^~^ is in ii with respect to dt (i.e., \(j){x)\x'^~^dx < 00), then the Mellin 

transform is defined on the line a + iM. Moreover, if (j){x)x'^~^ is in ii for a = ai and 
for a = a 2 , where a 2 > <Ji, then it is easy to see that it is also in £i for all a G (cri, < 72 ), 
and that, moreover, the Mellin transform is holomorphic on {s : CTi < 3?(s) < tT 2 }- We 
then say that {s : cti < 3?(s) < cr 2 } is a strip of holomorphy for the Mellin transform. 

The Mellin transform becomes a Fourier transform (of by means 

of the change of variables X = We thus obtain, for example, that the Mellin 

transform is an isometry, in the sense that 

r°° fjr,. 1 r°° 

/ |/(x)|V"- = — / \Mf{a + it)\'^dt. (2.6) 

JQ ^ J— 00 


Recall that, in the case of the Fourier trans form , for I/I2 = I/I2 to hold, it is enough 
that / be in £i n £ 2 - This gives us that, for (2.6 1 to hold, it is enough that f{x)x'^~^ be 
in £i and f{x)x'^~^^^ be in £2 (again, with respect to dt, in both cases). 

We write / *m 9 for the multiplicative, or Mellin, convolution of / and g: 




/' x\ dw 
w 


(2.7) 


In general. 


M{f *M g) = Mf ■ Mg 


(2.8) 
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and 


-I n(7-\-i00 

M{f-g)is) = — Mf{z)Mg{s - z)dz 

ZTTZ J 


IIGR941 §17.32] (2.9) 


provided that z and s — z are within the strips on which Mf and Mg (respectively) are 
well-dehned. 

We also have several useful transformation rules, just as for the Fourier transform. 
For example, 

M{f{t)){s) = -{s-l)-Mf{s-l), 

M{tf'{t)){s) = -s-Mf{s), (2.10) 

M{{\ogt)f(t))(s) = {Mf)\s) 


(as in, e.g., lIBBOlOl Table 1.11]). 
Let 


t?2 — (2 • 1[1/2,1]) *M (2 • 1[1/2,1])- 


Since (see, e.g., MBBOIOI Table 11.3] or IIGR94I §16.43]) 


(M/, 


we see that 


Mt]2{s) = 


1 - 2 - 


Af 774(5) = 


1 - 2 - 


( 2 . 11 ) 


Let fz = e where 3?(z) > 0. Then 

r°° 1 

{Mf){s) = / = - 

Jo ^ Jo 


e ^dt 


— f / e-H^-^dt=^^. 


where the next-to-last step holds by contour integration, and the last step holds by the 
dehnition of the Gamma function r(s). 


2.5 Bounds on sums of /i and A 

We will need some simple explicit bounds on sums involving the von Mangoldt func¬ 
tion A and the Moebius function /r. In non-explicit work, such sums are usually 
bounded using the prime number theorem, or rather using the properties of the zeta 
function (((s) underlying the prime number theorem. Here, however, we need robust, 
fully explicit bounds valid over just about any range. 

For the most part, we will just be quoting the literature, supplemented with some 
computations when needed. The proofs in the literature are sometimes based on prop¬ 
erties of C(s), and sometimes on more elementary facts. 
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First, let us see some bounds involving A. The following bound can be easily 
derived from IIRS62I (3.23)], supplemented by a quick calculation of the contribution 
of powers of primes p < 32: 


< logo:. 


n 


( 2 . 12 ) 


We can derive a bound in the other direction from IIRS621 (3.21)] (for x > 1000, 
adding the contribution of all prime powers < 1000) and a numerical verification for 
a: < 1000: 


^ Ain) 3 

> -> logx - log 

^ n V2 

n<.x 


(2.13) 


We also use the following older bounds: 


1. By the second table in IIRR96i p. 423], supplemented by a computation for 
2 • 10® < y < 4 • 10®, 

^ A(n) < 1.0004y (2.14) 

n<y 


2 . 


for 2 / > 2 • 10®. 

Hn) < 1.038832/ 

n<y 

for every y > 0 IIRS621 Thm. 12]. 


(2.15) 


(2.16) 


For all y > 663, 

y2 

^ A(n)n < 1.03884y, 

n<y 

where we use ( 2.15| l and partial summation for y > 200000, and a computation for 
663 < y < 200000. Using instead the second table in I1RR961 p. 423], together with 
computations for small y < 10^ and partial summation, we get that 


^ A(n)n < 1.0008y (2.17) 

n<v 

for y > 1.6 • 10®. 

Similarly, 

V < 2 • 1.0004vA/ (2-18) 

n<y 

for all 2 / > 1. 

It is also true that 

(logp)^ < ^ 2 /(log 2 /) (2.19) 

y/2<p<y 
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for y > 117; this holds for y > 2 ■ 758699 by IIRS75I Cor. 2] (applied to x = y, 
X = y/2 and x = 2y/3) and for 117 < y < 2 ■ 758699 by direct computation. 

Now let us see some estimates on sums involving y,. The situation here is less 
satisfactory than for sums involving A. The main reason is that the complex-analytic 
approach to estimating J2n<N would involve 1 /C(s) rather than C{s)/C{s), and 

thus strong explicit bounds on the residues of 1/C(s) would be needed. Thus, explicit 
estimates on sums involving y are harder to obtain than estimates on sums involving A. 
This is so even though analytic number theorists are generally used (from the habit of 
non-explicit work) to see the estimation of one kind of sum or the other as essentially 
the same task. 

Fortunately, in the case of sums of the type J2n<x /^(^) /nfoix arbitrary (a type of 
sum that will be rather important for us), all we need is a saving of (log n) or (log n)^ 
on the trivial bound. This is provided by the following. 

1. (Granville-Ramare IIGR96I . Lemma 10.2) 


E 

n<x:gcd{n,q) — l 


F{n) 

n 


< 1 


( 2 . 20 ) 


for all X, q>l, 

2. (Ramare 0Raml3ll : cf. El Marraki MEM95I . nEM96l ) 


E 

n<.x 


F{n) 

n 


0.03 
log a; 


force > 11815. 

3. (Ramare BRambI ) 


E 


y{n) 


= O* 


n<rc:gcd(n,g) = l 

for all X and all q < x', 


1 4 q 


logx/q 5(()(g) 


E 


y{n) 


log - = O* 1.00303 


n<x;gcd(n,(3') = l 


n n 




( 2 . 21 ) 


( 2 . 22 ) 


(2.23) 


for all X and all q. 


Improvements on these bounds would lead to improvements on type I estimates, but 
not in what are the worst terms overall at this point. 

A computation carried out by the author has proven the following inequality for all 
real x < 10^^: 


E 

n<x 


n 



(2.24) 
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The computation was conducted rigorously by means of interval arithmetic. For the 
sake of verification, we record that 

5.42625 • 10"® < V < 5.42898 • 10"®. 

n 

n<10i2 

Computations also show that the stronger bound 


E 

n<.x 


T{n) 

n 


1 



holds for all 3 < a; < 7727068587, but not for x = 7727068588 — e. 

Ea rlier, numerical work carried out by Olivier Ramare IIRaml4l had shown that 
(2.24 1 holds for all x < 10^°. 


2.6 Interval arithmetic and the bisection method 

Interval arithmetic has, at its basic data type, intervals of the form I = [a/2^, 6/2^], 
where a,b,£ S Z and a < b. Say we have a real number x, and we want to know sin(a;). 
In general, we cannot represent x in a computer, in part because it may have no finite 
description. The best we can do is to construct an interval of the form / = [a/2^, b/2^] 
in which x is contained. 

What we ask of a routine in an interval-arithmetic package is to construct an interval 
/' = [aV2^ , b'/2^ ] in which sin(/) is contained. (In practice, this is done partly in 
software, by means of polynomial approximations to sin with precise error terms, and 
partly in hardware, by means of an efficient usage of rounding conventions.) This gives 
us, in effect, a value for sin(x) (namely, (o' + 6')/2^ and a bound on the error term 
(namely, {b' — a')/2^'+^). 

There are several implementations of interval arithmetic available. We will almost 
always use D. Platt’s implementation OPlal II of double-precision interval arithmetic 
based on Tambov’s BLamOSI ideas. (At one point, we will use the PROFIL/BIAS inter¬ 
val arithmetic package IIKnu99L since it underlies the VNODE-LP BNedOhl package, 
which we use to bound an integral.) 

The bisection method is a particularly simple method for finding maxima and min¬ 
ima of functions, as well as roots. It combines rather nicely with interval arithmetic, 
which makes the method rigorous. We follow an implementation based on BTucl 11 
§5.2]. Let us go over the basic ideas. 

Let us use the bisection method to find the minima (say) of a function / on a 
compact interval /q. (If the interval is non-compact, we generally apply the bisection 
method to a compact sub-interval and use other tools, e.g., power-series expansions, 
in the complement.) The method proceeds by splitting an interval into two repeatedly, 
discarding the halfs where the minimum cannot be found. More precisely, if we im¬ 
plement it by interval arithmetic, it proceeds as follows. First, in an optional initial 
step, we subdivide (if necessary) the interval Ig into smaller intervals Ik to which the 
algorithm will actually be applied. For each k, interval arithmetic gives us a lower 
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bound and an upper bound on {f{x) : x € Ik}\ here and are both of 

the form a/2^, a, I G Z. Let mo be the minimum of over all k. We can discard 

all the intervals Ik for which > mo. Then we apply the main procedure: starting 
with i = 1, split each surviving interval into two equal halves, recompute the lower and 
upper bound on each half, define m^, as before, to be the minimum of all upper bounds, 
and discard, again, the intervals on which the lower bound is larger than m^; increase i 
by 1. We repeat the main procedure as often as needed. In the end, we obtain that the 
minimum is no smaller than the minimum of the lower bounds (call them ir^'‘^)k) on 
(i) 

all surviving intervals Ij. . Of course, we also obtain that the minimum (or minima, if 
there is more than one) must lie in one of the surviving intervals. 

It is easy to see how the same method can be applied (with a trivial modification) 
to find maxima, or (with very slight changes) to find the roots of a real-valued function 
on a compact interval. 
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Part I 

Minor arcs 
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Chapter 3 

Introduction 


The circle method expresses the number of solutions to a given problem in terms of 
exponential sums. Let rj : R+ —^ C be a smooth function, A the von Mangoldt function 


(defined as in (1.51) and e(t) = The estimation of exponential sums of the type 


Sr^{a,x) = E A{n)e(an)r]{n/x), 


(3.1) 


where a G M/Z, already lies at the basis of Hardy and Littlewood’s approach to the 
ternary Goldbach problem by means of the circle method IHL22I . The division of the 
circle M/Z into “major arcs” and “minor arcs” goes back to Hardy and Littlewood’s 
development of the circle method for other problems. As they themselves noted, as¬ 
suming GRH means that, for the ternary Goldbach problem, all of the circle can be, 
in effect, subdivided into major arcs - that is, under GRH, can be estimated with 
major-arc techniques for a arbitrary. They needed to make such an assumption pre¬ 
cisely because they did not yet know how to estimate ST^{a, x) on the minor arcs. 

Minor-arc techniques for Goldbach’s problem were first developed by Vinogradov 
IIVin37l . These techniques make it possible to work without GRH. The main obstacle 
to a full proof of the ternary Goldbach conjecture since then has been that, in spite of 
gradual improvements, minor-arc bounds have simply not been strong enough. 

As in all work to date, our aim will be to give useful upper bounds on p.l| l for 
a in the minor bounds, rather than the precise estimates that are typical of the major- 
arc case. We will have to give upper bounds that are qualitatively stronger than those 
known before. (In Part|I^ we will also show how to use them more efficiently.) 

Our main challenge will be to give a good upper bound whenever q is larger than a 
constant r. Here “sufficiently good” means “smaller than the trivial bound divided by 
a large constant, and getting even smaller quickly as q grows”. Our bound must also be 
good for a = a/q + 5/x, where q < r but 5 is large. (Such an a may be said to lie on 
the tail {5 large) of a major arc (q small).) 

Of course, all expressions must be explicit and all constants in the leading terms of 
the bound must be small. Still, the main requirement is a qualitative one. For instance, 
we know in advance that a single factor of log a; would be the end of us. That is, we 
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know that, if there is a single term of the form, say, {x log x)/q, and the trivial bound 
is about X, we are lost; {x log x)/qis greater than x for x large and q constant. 

The quality of the results here is due to several new ideas of general applicability. 
In particular, §5.1| introduces a way to obtain cancellation from Vaughan’s identity. 
Vaughan’s identity is a two-log gambit, in that it introduces two convolutions (each of 
them at a cost of log) and offers a great deal of flexibility in compensation. One of the 
ideas presented here is that at least one of two logs can be successfully recovered after 
having been given away in the first stage of the proof. This reduces the cost of the use 
of this basic identity in this and, presumably, many other problems. 

There are several other improvements that make a qualitative difference; see the 
discussions at the beginning of Qand ^ Considering smoothed sums - now a com¬ 
mon idea - also helps. (Smooth sums here go back to Hardy-Littlewood IIHL22I - both 
in the general context of the circle method and in the context of Goldbach’s ternary 
problem. In recent work on the problem, they reappear in flTaol41 .) 


3.1 Results 


The main bound we are about to see is essentially proportional to ((log q)/\J4 >{q)) ' 
The term Jq serves to improve the bound when we are on the tail of an arc. 


Theorem 3.1.1. Let x > xq , xq = 2.16 • 10^°. Let Sr^{a,x) be as in {3.1 ( with rj 


defined in \3.4\ . Let 2a = a/q -f 5/x, q < Q, gcd(a, g) = 1 , \5/x\ < iJqQ, where 
Q — (3/4)x^. Ifq < then 


\Snia,x)\ < 


Rx,Soq^ogSoq + 0.5 


V^oq Ooq 


where 


(3.2) 


do = max(2, |d|/4), Rx,t = 0.27125 log 1 -f 


log At 


2 log 

^ 2.004t , 


0.41415, 


(3.3) 


Lx.t.q = ^ (^^logt + 7.82 ) + 13.66logf +37.55. 

Ifq > a;^/^/6, then 

|S'^(a,a:)| < 0.276a;®/®(log + 1234x2/® log a;. 

The factor R^.t is small in practice; for instance, for x = 102® and dog = 5 • 10® 
(typical “difficult” values), Rx.Sn a equ als 0.59648 .... 

The classical choic^for rj in (3.1 1 is 77(f) = 1 for f < 1, 77 (f) = 0 for f > 1, which, 
of course, is not smooth, or even continuous. We use 


t?(i) = m{t) = 4max(log2 - | log 2 f|, 0 ), 


(3.4) 


^ Or, more precisely, the choice made by Vinogradov and followed by most of the literature since him. 
Hardy and Littlewood IHL22I worked with 77 (f) = e“^. 
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as in Tao IITaol4l . in part for purposes of comparison. (This is the multiplicative con¬ 
volution of the characteristic function of an interval with itself.) Nearly all work should 
be applicable to any other sufficiently smooth function rj of fast decay. It is important 
that T] decay at least quadratically. 

We are not forced to use the same smoothing function as in Part [II] and we do not. 
As was explained in the introduction, the simple technique ( | 1 .40| i allows us to work 
with one smoothing function on the major arcs and with another one on the minor arcs. 


3.2 Comparison to earlier work 


Table|3.Ij compares the bounds for the ratio \Sri{a/q, x)\/x given by this paper and by 
IITaol4i rThm. 1.3] for x = 10^^ and different values of q. We are comparing worst 
cases; 4){q) as small as possible {q divisible by 2 • 3 • 5 • • •) in the result here, and q 
divisible by 4 (implying Aa ~ a/{q/A)) in Tao’s result. The main term in the result in 
this paper improves slowly with increasing x\ the results in 0Taol4l worsen slowly with 
increasing x. The qualitative gain with respect to the main term in 0Taol41 (1.10)] is in 
the order of log(g) \/(t>{q)/q. Notice also that the bounds in IITaol4l are not log-free; in 
IITaol4i (1.10)], there is a term proportional to xilogxY/q. This becomes larger than 
the trivial bound x for x very large. 

The results in IIDROll are unfortunately worse than the trivial bound in the range 
covered by Table 3.1| Ramare’s results ( IIRamlOl Thm. 3], ORamcl Thm. 6]) are not 


applicable within the range, since neither of the conditions log q < (l/50)(loga;)i/3, 
q < x^^^^ is satisfied. Ramare’s bound in BRamcl Thm. 6] is 


^ A{n)e{an/q) 


x<.n<2x 


< ISOOO^a; 

Hq) 


(3.5) 


for 20 < q < We should underline that, while both the constant 13000 and the 

condition q < ^1/48 keep ( |3.5| l from being immediately useful in the present context, 
0 is asymptotically better than the results here as q oo. (Indeed, qualitatively 
speaking, the form of ( |3.5| l is the best one can expect from results derived by the family 
of methods stemming from Vinogradov’s work.) There is also unpublished work by 
Ramare (ca. 1993) with different constants for q (log x/ log log x)^. 


3.3 Basic setup 

In the minor-arc regime, the first step in estimating an exponential sum on the primes 
generally consists in the application of an identity expressing the von Mangoldt func¬ 
tion A(n) in terms of a sum of convolutions of other functions. 

3.3.1 Vaughan’s identity 

We recall Vaughan’s identity 0Vau77bl : 


A = u<u * log +u<u * Acy * 1 -f /r >(7 * A>y *1-1- Acv, 


(3.6) 
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9o 

\Sjj{a/q,x)\ 

X ’ 

\Sr,{a/q,x)\ 

X ’ 


0.04661 

0.34475 

1.5 • 10’^ 

0.03883 

0.28836 

2.5 • 10’^ 

0.03098 

0.23194 

5 • 10^ 

0.02297 

0.17416 

7.5 • 

0.01934 

0.14775 

10® 

0.01756 

0.13159 

10^ 

0.00690 

0.05251 


Table 3.1; Worst-case upper bounds on x ^\STj{a/2q,x)\ for q > qo, |i5| < 8, x = 
10^^. The trivial bound is 1. 


where 1 is the constant function 1, and where we write 


f<z{n) 


f{n) if n < z 
0 if n > z, 


f>z{n) 


0 if n < z 
f{n) if n > z. 


Here f * g denotes the Dirichlet convolution (/ * g){n) = X]d|n 
set the values of U and V however we wish. 

Vaughan’s identity is essentially a consequence of the Mobius inversion formula 


(l*^)(n) = 


if n = 1, 
otherwise. 


Indeed, by (3.7 1 , 


A>vin) = ^ p(fi)A>v(m) 

dm\n 

= X! T<u{d)Ayv{rn) + ^ ^>c/(d)A>y( to). 

dm|n dm\n 


(3.7) 


Applying to this the trivial equality A>y = A — A<y, as well as the simple fact that 
1 * A = log, we obtain that 

A>v{n) = '^g,<u{d)\og{n/d) - ^ g.<u{d)A<v{'m) + ^ fj.>u{d)Ayv{m). 

d\n dm\n dm\n 


By Ay = A>y + A>y, we conclude that Vaughan’s identity (3.6 1 holds. 

Applying Vaughan’s identity, we easily get that, for any function g : 1 
completely multiplicative function / : > C and any x > 0, U,V >0, 


any 


A{n)f{n)e{an)g{n/x) = Sj^i - S '/,2 + S// + So, 00 , (3.8) 
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where 


51.1 = /i(m)/(TO) y^(log n)e{arnn)f{n)ri{rnn/x), 

m<U n 

51 .2 = ^ A(d)/(d) ^ fj.{m)f{m) E e(a(imn)/(n)y7((imn/x), 


d<v 


m<U 


( \ 


(3.9) 


S'// = fim) //(d) A{n)e{amn) f {n)r]{mn/x). 


m>U 


1 d>t/ I 

\ d|m / 


n>y 


>5'o,oo = X! A(n)e(an)/(n)77(n/a;). 


n<y 

We will use the function 


/w = 


1 if gcd(n, u) = 1, 
0 otherwise, 


(3.10) 


where u is a small, positive, square-free integer. (Our final choice will be u = 2.) Then 
S^(x, a) = S/_i — S/_2 + S// + So,oo + So,u,, (3.11) 


where S,;(x, a) is as in (3.1 1 and 


So,v = A{n)e{an)ri{n/x). 


The sums S/ i, S /,2 are called “of type I”, the sum Sjj is called “of type 11” (or 
bilinear). (The not-all-too colorful nomenclature goes back to Vinogradov.) The sum 
So,oo is in general negligible; for our later choice of V and rj, it will be in fact 0. The 
sum So,« will be negligible as well. 

As we already discussed in the introduction, Vaughan’s identity is highly flexible 
(in that we can choose U and V at will) but somewhat inefficient in practice (in that a 
trivial estimate for the right side of ( |3.11[ ) is actually larger than a trivial estimate for 
the left side of ( |3.11[ )). Some of our work will consist in regaining part of what is given 
up when we apply Vaughan’s identity. 


3.3.2 An alternative route 

There is an alternative route - namely, to use a less sacrificial, though also more in¬ 
flexible, identity. While this was not, in the end, the route that was followed, let us 
nevertheless discuss it in some detail, in part so that we can understand to what extent 
it was, in retrospect, viable, and in part so as to see how much of the work we will 
undertake is really more or less independent of the particular identity we choose. 









48 


CHAPTERS. INTRODUCTION 


Since ('{s)/C{s) = A(n)n ® and 

/C'(^)V^V (C'js) 

\as)J cisf ) 

^ _ K"is)cis) , 2 

c(s) c(s)2 U(^); 

= c'w /c'(^)y 

c(s) U(s);c(s) \as)J 

we can see, comparing coefficients, that 

A • log^ = ^ * log^ —3(A • log) *A — A*A*A, 

as was stated by Bombieri in IIBom76l . 

Here the term fj, * log^ is of the same kind as the term fj,<u* log we have to estimate 
if we use Vaughan’s identity, though the fact that there is no truncation at U means that 
one of the error terms will get larger - it will be proportional to x, in fact, if we sum 
from 1 to X. The trivial upper bound on the sum of A • log^ from 1 to a; is a;(log a;)^; 
thus, an error term of size x is barely acceptable. 

In general, when we have a double or triple sum, we are not very good at getting 
better than trivial bounds in ranges in which all but one of the variables are very small. 

This is the source of the large eri'or term that appears in the sum involving /i * log^ 

because we are no longer truncating as for fj,<u * log. It will also be the source of other 
large error terms, including one that would be too large - namely, the one coming from 
the term (A • log) * A when the variable of A • log is large and that of A is small. (The 
trivial bound on that range is ^ x log x.) 

We avoid this problem by substituting the identity A • log = /i * log^ —A * A inside 

ITni i: 

A • log^ = fi * log^ * log^) * A + 2A * A * A. (3.14) 

(We could also have got this directly from the next-to-last line in p.l2| l.) When the 
variable of A in (p, * log^) * A is small, the variable of p * log^ is large, and we can 
estimate the resulting term using the same techniques as for p * log^. 

It is easy to see that we can in fact mix ( |3.13| l and ( |3.14| i: 

A • log^ = P * log^ -3 ((A • log) * A>y + (p * log^) * A<y) 

+ (—A>y * A * A + 2A<y * A * A) 

for V arbitrary. Note here that there is some cancellation in the last term: writing 


(3.12) 


(3.13) 


= (—A>y * A * A + 2A<y * A * A) (n), (3.16) 

we can check easily that, for n = P 1 P 2 P 3 square-free with < n, we have 


Fzyin) 


{ -6 log Pi logp2logP3 

6 log Pi logP2 logP3 
12 log Pi logP2 logP3 


if all Pi > V, 

if Pi < P2 < 14 < P3, 

if Pi < 14 < P2 < P3, 

if all Pi < 14. 
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In contrast, for n square-free, —A * A * A(n) is —6 if n is of the form P 1 P 2 P 3 ,, and 0 
otherwise. 

We may find it useful to take aside two large terms that may need to be bounded 
trivially, namely, p * log<j( and (A • log)<u * A>v, where u will be a small parameter. 
(We can let, for instance, u = 3.) We conclude that 

A • log^ = Fpi^u{n) - 3Fi^2,v,uin) - iFH,v,u{n) + F^yin) + Foyu{n), (3.17) 


where 


Fpi,u = M*logtti: 

Fp2,V,u = {t* log^) * A<y, 

Fiiyuin) = (A • log)>„ * A>y, 

Fo,v,u{n) = p* log<j, -3(A • log)<„ * A>y 


and F^y is as in (3.161. 


In the bulk of the present work - in particular, in all steps that are part of the proof 


of Theorem 3.1.1 or the Main Theorem - we will use Vaughan’s identity, rather than 
p.l7| l. This choice was made while the proof was still underway; it was due mainly 
to back-of-the-envelope estimates that showed that the error terms could be too large 


if (3.14 1 was used. Of course, this might have been the case with Vaughan’s identity 
as well, but the fact that the parameters U, V there have a large effect on the outcome 
meant that one could hope to improve on insufficient estimates in part by adjusting U 
and V, without losing all previous work. (This is what was meant by the “flexibility” 
of Vaughan’s identity.) 

The question remains: can one prove ternary Goldbach using ( 3.17[ ) rather than 
Vaughan’s identity? This seems likely. If so, which proof would be more complicated? 
This is not clear. 

There are large parts of the work that are the essentially the same in both cases: 

• estimates for sums involving p<u * log^ (“type I”), 

• estimates for sums involving A>u * A>y and the like (“type 11”). 

Trilinear sums, i.e., sums involving A* A* A, can be estimated much like bilinear sums, 
i.e., sums involving A * A. 

There are also challenges that appear only for Vaughan’s identity and others that 
appear only for ( |3.17| l. An example of a challenge that is successfully faced in the main 
proof, but does not appear if (3.17|i is used, consists in bounding sums of type 


( \ 
/ 


U<m<xlW I d>U 
\d\m 


(In (5.1 we will be able to bound sums of this type by a constant times xjW.) Like¬ 
wise, large tail terms that have to be estimated trivially seem unavoidable in ( |3.17 1. 
(The choice of a parameter u > 1, as above, is meant to alleviate the problem.) 
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In the end, losing a factor of about \ogx/UV seems inevitable when one uses 
Vaughan’s identity, but not when one uses ( |3.17| l. Another reason why a full treatment 
based on ( |3.17| l would also be worthwhile is that it is a somewhat less familiar, and 
arguably under-used, identity and deserves more exploration. With these comments, 
we close the discussion of (3.17|i; we will henceforth use Vaughan’s identity. 









Chapter 4 

Type I sums 


Here, we must bound sums of the basic type 

//(to) e(amn)r7 

m<D n 

and variations thereof. There are three main improvements in comparison to standard 
treatments: 

1. The terms with to divisible by q get taken out and treated separately by analytic 
means. This all but eliminates what would otherwise be the main term. 

2. The other terms get handled by improved estimates on trigonometric sums. For 
large to, the improvements have a substantial total effect - more than a constant 
factor is gained. 

3. The “error” term 5/x = a — a/q is used to our advantage. This happens both 
through the Poisson summation formula and through the use of two alternative 
approximations to the same number a. 

The fact that a continuous weight p is used (“smoothing”) is a difference with respect 
to the classical literature ( IIVin37l and what followed), but not with respect to more 
recent work (including 0Taol4ll k using smooth or continuous weights is an idea that 
has become commonplace in analytic number theory, even though it is not consistently 
applied. The improvements due to smoothing in type I are both relatively minor and 
essentially independent of the improvements due to ([T]i and ([^. The use of a contin¬ 
uous weight combines nicely with but the ideas given here would give qualitative 
improvements in the treatment of trigonometric sums even in the absence of smoothing. 


4.1 Trigonometric sums 

The following lemmas on trigonometric sums improve on the best Vinogradov-type 
lemmas in the literature. (By this, we mean results of the type of Lemma 8a and 
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Lemma 8b in IIVin04l Ch. I]. See, in particular, the work of Daboussi and Rivat BDROll 
Lemma 1].) The main idea is to switch between different types of approximation within 
the sum, rather than just choosing between bounding all terms either trivially (by A) 
or non-trivially (by C/\ sin(7ran)p). There will alsc[*]be improvements in our appli¬ 
cations stemming from the fact that Lemmas |4. 1.1 1 and Lemma 4.1.2 take quadratic 
(I sin(7ran)p) rather than linear (| sin(7ran)|) inputs. (These improved inputs come 
from the use of smoothing elsewhere.) 


Lemma 4.1.1. Let a = ajq A fi/qQ, {a,q) = 1, |/3| < 1, q < Q. Then, for any 
A,C>0, 


E mm ( A, —— -^ 

V sm(7rQ;n)p 

V<n<y+q ^ ^ '' 


< min (2A+. (4.1) 


Proof. We start by letting mo = ly\ + [(g -b 1)/2J, j = n — mo, so that j ranges in 
the interval {—q/2,q/2]. We write 

+ C r / N r 1 

an = -b +02 mod 1, 

q 

where |5i(j)| and 1^21 are both < l/2g; we can assume 62 > 0. The variable r = 
aj -b c modg occupies each residue class modp exactly once. 

One option is to bound the terms corresponding to r = 0, —1 by A each and all 
the other terms by C/j sin(7ran)p. (This can be seen as the simple case; it will take 
us about a page just because we should estimate all sums and all terms here with great 
care - as in IDROll . only more so.) 

The terms corresponding to r = —k and r = k — 1 (2 < k < q/2) contribute at 
most 

1 ^ 1 

sin2 ^{k-l- qS2) ^ sin2 - f + " sin^ ^ (/c - i) + sin^ ^ (fc - f) ’ 

since x 1 —>■ is convex-up on (0, 00 ). Hence the terms with r f 0,1 contribute at 

most 



rq/2 J 

■ (^’ 


where we use again the convexity of a; 1 —l/(sina;)^. (We can assume q > 2, as 
otherwise we have no terms other than r = 0,1.) Now 



^This is a change with respect to the first version of the preprint iHelbl . The version of Lemma [4.1.l| 
there has, however, the advantage of being immediately comparable to results in the literature. 
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Hence 


V min ( A, — 


y<n<y+q 


c 


(sin Tran)' 


<2A + 


C 




„ 2.q TT 
+ C • — cot —. 

TT q 


Now, by llAS^ (4.3.68)] and llAS64l (4.3.70)], for t G (-tt, tt). 


^— 7 — 1 + ^ a2k+it^^'^‘^ — 1 + y + ■ ■ ■ 


f cot t = 1 - ^ b 2 k+it^''^‘^ = 1 T “ 7k “ 
^0 3 45 


(4.2) 


where a 2 fc+i > 0, & 2 fe+i > 0. Thus, for t G [0, fo]^ fo < tt, 

i \2 .2 ,2 

, sint 


— 1 + — + Co{t)t^ < 1 + ^ + Co{to)t‘^, 


(4.3) 


where 




which is an increasing function because a 2 k+i > 0. For to = 7'‘/4, co(to) ^ 0.074807. 
Hence, 




sin^ t 


+ fcot2f< (^l + -+co(-)f j + —j 


3 U 


2 3 


for t G [0,7r/4]. 

Therefore, the left side of (|4.11l is at most 


(i) 


= ;t-Tr+ co--— r <---<- 


45 


3 U 


2 3 


271 + C- ) • 2 =2Gl+2-Cg". 

\ TT / 2 -^2 


The following is an alternative approach; it yields the other estimate in (4.11. We 
bound the terms corresponding to r = 0, r = — 1, r = 1 by A each. We let r = ±r' 
for r' ranging from 2 to q/2. We obtain that the sum is at most 


/ 

3A + min A. 

2<r'<q/2 

+ ^ min I A, 

2<r'<ql2 


c 


V (sin|(r'-4-(7(52)) 

_ C _ 

(sin "qir' -\+ qS2) 


(4.4) 
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We bound a term min(A, (7/ sin((7r/(7)(r' — 1/2 ± q 62 ))^) by A if and only if 
C/ sm{{TT/q){r' — 1 ± q 52 ))'^ > A. (In other words, we are choosing which of the two 
bounds A, (7/| sin(7ran)p on a case-by-case basis, i.e., for each n, instead of making 
a single choice for all n in one go. This is hardly anything deep, but it does result in 
a marked improvement with respect to the literature, and would give an improvement 
even if we were given a bound B/ \ sin(7ran)| instead of a bound C/j sin(7rQ;n)p as 
input.) The number of such terms is 


< max(0, [(q/7r) C/A) =F qS 2 \), 


and thus at most {2q/'K) aics\n{yjC/A) in total. (Recall that q 52 < 1/2.) Each 
other term gets bounded by the integral of (7/ sin^fT ra/q ) from r' — 1 ± qS 2 (> 
(^/tt) axcsvi{^JC/A)) to r' ± q 62 , by convexity. Thus (4.4l is at most 


3A 


2 q 


A arcsin' 


2 q 


r9/2 


C 


' — arcsin -< 


C 2q 


re sin^ — 

A q 


dt 


< 3A H — -Aa,TCsm\ — H— -C\ — — 1 


A 


A 


C 


We can easily show (taking derivatives) that arcsin a; + x(l — x^) < 2x for 0 < 
a: < 1. Setting x = C/A, we see that this implies that 


3^ -I- — A arcsin \l'^ + —C\l^ — 1 < 3^ -f —VAC. 

C TT 


2 q 


A 


4q 


(If C/A > 1, then 3A + [4q/'K)\fAC is greater than Aq, which is an obvious upper 
bound for the left side of ( |4.1| ).) □ 

Now we will see that, if we take out terms with n divisible by q and n is not too 
large, then we can give a bound that does not involve a constant term A at all. (We are 
referring to the bound {2Q/3-K^)Cq^ below; of course, 2A + {4q/'iT)VAC does have 
a constant term 2^1 - it is just smaller than the constant term 3A in the corresponding 
bound in ( |4. l[ i.) 

Lemma 4.1.2. Let a = a/q + /S/qQ, (a, q) = 1, |/3| < I, q < Q. Let y 2 > yi> 0. If 
y 2 — yi V q and y 2 < Q/% then, for any A,C > 0 , 


yi<n<y2 

at" 


A, 


C 


sin(7rQ;n)p 


< min 


(|/c,^2.4+^^C^c), (4.5) 


Proof Clearly, an equals an/q+ {n/Q)l3/q\ since y 2 < Q/2, this means that \an — 
an/q\ < l/ 2 q for n < y 2 ', moreover, again for n < y 2 , the sign of an — an/q remains 
constant. Hence the left side of (|4.5|) is at most 



C 

(sin|(r-l/2))2 



C 


(sin ^r)2 

V q / 
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Proceeding as in the proof of Lemma |4.1.1 we obtain a bound of at most 
1 1 


C 


q TT q Hit 

(sin^)2 (sin|)2 TT q -K 2q ^ 


for g > 2. (If g = 1, then the left-side of ( |4.5[ ) is trivially zero.) Now, by ( |4.2[ |, 


(sinf)- 


+ ^cot2f< (^l + |+co(0f4^ 


^ 16f^ 

4\~ll 


45 


5 

< - 
- 4 


(f) 


4 \ ,4 ,5 


C0(T)-^)i^<7 


for t € [0,7r/4], and 

3f 


{sinty 


'6t I ^ U /TTN 4\ 2 / ^ at^ 

fcot-<fl + -+co(-)f 


8ir 
24.45 


5 / 1 

“ 3 (~6 




for f e [0,7r/2]. Hence, 
( I 1 


(sin^)2 (sin|)2 TT g tt 


q , TT g 37r \ / 2g 

— cot — I — cot — < — 


2g 


r < - 


g\2 5 


20 


’4"^ Vtt/ ’3 “ 37r2® ■ 


Alternatively, we can follow the second approach in the proof of Lemma |4.1.1| and 
obtain an upper bound of 2A -f (4g/7r)v/AC. 

□ 


The following bound will be useful when the constant A in an application of 


Lemma 4.1.2 would be too large. (This tends to happen for n small.) 


Lemma 4.1.3. Let a = a/q + jS/qQ, (a, g) = 1, \ j3\ < I, q < Q. Let j /2 > 2 /i > 0. Tf 
y 2 — yi q ond y 2 Q /2, then, for any B,C > 0, 


E ' 

yi<n<y 2 

ijfrt 


B 


C 


sin(7ran)| ’ | sin(7ran)|^ 


<2Bim«(2,log^). 


(4.6) 


The upper bound < (flBqj'K) log(2e^g/7r) is also valid. 

Proof. As in the proof of Lemma |4.L2| we can bound the left side of ( |4.6| l by 

C \ 


q/2 

2 min 


B 


r—1 


,sinf (r-i)’sm2|(r-4)y ’ 
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Assume Bsm{7r/q) < C < B. By the convexity of l/sin(f) and l/sin(f)^ for 
t e (0,7r/2], 



1 



c sin^ -t 


dt 


3 ( B ( log tan ( ^ arcsin ^ ) — log tan ^ ) + C cot arcsin ^ 


2q 


— B log cot --log 


C 


2q 


Now, for all t G (0,7r/2), 


1 


B - 


1 


^ - 672 ^ . 


+ - log cot f < - log — 


sinf t 

we can verify this by comparing series. Thus 
B 


t 


t 




+ ’iJlogcotf <B«log"i« 
TT 2q 7T TT 


for g > 2. (If q = 1, the sum on the left of ( |4.6| l is empty, and so the bound we are 
trying to prove is trivial.) We also have 

t log(f — y/t^ — 1) + y/t'^ — I < —t log 2t-\-t (4.7) 

for f > 1 (as this is equivalent to log(2f^(l — y/l — t~^)) < 1 — y/l — which we 
check easily after changing variables to 5 = 1 — y/l — Hence 


B 


sin 2^ 


B ( log cot ^ — log -- 7 ^ 

' 2q ^ B- y/B^ - 


TT TT 


B B log 


2B 

TT 


^ + y/B^ - 
Ce^q 


<B^log 

TT BtT 


for q>2. 

Given any C, we can apply the above with C = B instead, as, fo r any t > 0, 
mm{B/t,C/t'^) < B/t < mm{B/t, B (We refrain from applying (4.7 1 so as to 
avoid worsening a constant.) If C < B sin7r/g (or even if C < {'K/q)B), we relax the 
input to C = B sin ir/q and go through the above. □ 


4.2 Type I estimates 

Let us give our hrst main type I estimate]^ One of the main innovations is the manner 
in which the “main term” (m divisible by q) is separated; we are able to keep error 

^The current version of Lemma [4.2.l| is an improvement over that included in the first version of the 
preprint (Hib]. 
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terms small thanks to the particular way in which we switch between two different 
approximations. 

(These are not necessarily successive approximations in the sense of continued 
fractions; we do not want to assume that the approximation a/q we are given arises 
from a continued fraction, and at any rate we need more control on the denominator q' 
of the new approximation a'/q' than continued fractions would furnish.) 

The following lemma is a theme, so to speak, to which several variations will be 
given. Later, in practice, we will always use one of the variations, rather than the 
original lemma itself. This is so just because, even though ( |4.8| l is the basic type of 
sum we treat in type I, the sums that we will have to estimate in practice will always 
present some minor additional complication. Proving the lemma we are about to give 
in full will give us a chance to see all the main ideas at work, leaving complications for 
later. 


Lemma 4.2.1. Let a = ajq + Sjx, {a,q) = 1, \5/x\ < l/qQ^, q < Qo, Qq > 16. Let 
rj be continuous, piecewise and compactly supported, with Ir^li = 1 and rj" G Li. 
Lef Co > iV'loo- 

Let 1 < D < X. Then, if\d\ < l/2c2, where C 2 = (37r/5y4io)(l + the 

absolute value of 

»—^ ^ / m.n. \ 

(4.8) 


^ /r(m)^e(aTOn)?7 
m<D n 


is at most 


min 1 


Co 


(27r(5)‘ 


E 


p,{m) 


(m,g) = l 


■O* { CO ^ 




D 


TT^ / V 2xq 2x 


(4.9) 


plus 


^^D + 3c.5log+ 
TT q 


, , + D 


|ry'|i / coc^q^ \ (2y/?,coCi 3ci SScoCaX 


(4.10) 


where ci = 1 + \rj'\i/(2x/D) and M G [min((5o/2, D), D]. The same bound holds if 
|(5| > 1/2c2 but D < (3o/2. 

In general, if\5\ > l/2c2, the absolute value of ^4.8\ is at most {4.9\ plus 


2-^/ coCi 


+ (1 + e) min ^ 




+ I,2i9) ( +Jlog+^ 


, Q /^o , (1 + ^) 1 + ‘2d\ X 35coC2 

for e G (0,1] arbitrary, where rUe = s/S + 2e + ((1 + \/l3/3)/4 — 1)/(2(1 + e)). 


(4.11) 
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In (4.9l, min(l,co/(27r(5)^) always equals 1 when |(5| < l/2c2 (since (3/5)(l + 
\/l3/3) > 1). 


Proof. Let Q = [a;/|<5(7|J. Then a = a/q + 0*{l/qQ) and q < Q. (If 5 = 0, we 
let Q = oo and ignore the rest of the paragraph, since then we will never need Q' or 
the alternative approximation a! f Let Q' = [(1 + e)Q] > Q + 1. Then a is not 
ajq + O* (Xj qQ'\ and so there must be a different approximation a! jf, (a', g') = 1, 
f < Q' such that a — a'jq' + 0*{l/q'Q') (since such an approximation always 
exists). Obviously, |a/g — a'/q'\ > X/qq', yet, at the same time, |a/g — a!/q'\ < 
X/qQ + Xjq'Q' < X/qQ + 1/((1 + e)q'Q). Hence q'/Q + g/((l + e)Q) > 1, and so 
q' > Q-q/{X+e) > {e/{X+e))Q. (Note also that (e/(l+e))(3 > {2\5q\/x)-\x/5q\ > 
1, and so q' > 2.) 


Lemma 4.1.2 will enable us to treat separately the contribution from terms with 
m divisible by q and m not divisible by g, provided that m < Q/2. Let M = 
min((5/2, Z3). We start by considering all terms with m < M divisible by g. Then 
e{amn) equals e{{6m/x)n). By Poisson summation. 


e{amn)r){mn/x) = f{n), 

n n 


where f(u) = e{{dm/x)u)r]{{m/x)u). Now 


/(n) = [ e{—un)f{u)du = — [ e ((s — —) m) r]{u)du = —rj (—n — s') . 
J m J \\ mJ J m \m / 


By assumption, m < M < Q/2 < x/2\Sq\, and so \x/m\ > 2\Sq\ > 26. Thus, by 
( |2.1[ ) (with k = 2), 


y] f{n) = 


X 

m 


X 

m 


X , 


^ j 


v{-S) + 0* 5] 


= —vi-5) + 

m X {2 tt)‘^ 




O* I max ^ 


(4.12) 


Since x ^ 1/x^ is convex on K+, 


max > --^ = > - 


= TT^ - 4. 
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Therefore, the sum of all terms with m < M and q\mis 


m<M m<M 

q\m 

xp{q) 




m Co 

X (27r)^ 

T{m) 

m 


(^2-4) 


+ O* lJ.{qfco 


(m.q) = l 

1 1 w 

4 TT^ / I 2xq 2x) 


We will bound |77(—15)| by (2.1 1 . 


As we have just seen, estimating the contribution of the terms with m divisible by 
q and not too large (m < M) involves isolating a main term, estimating it carefully 
(with cancellation) and then bounding the remaining error terms. 

We will now bound the contribution of all other m - that is, m not divisible by q 
and m larger than M. Cancellation will now be used only within the inner sum; that 
is, we will bound each inner sum 


'mn\ 

X / ’ 


Tm{a) = ^ e{amn)r] (^- 

n 

and then we will carefully consider how to bound sums of \Tm{a) \ over m efficiently. 

m Co 


By (2.21 and Lemma 2.3.1 


\Tm{.a)\ < min 


1 


m 


w\ 


1 


2' ' |sin(7rmQ;)| x 4 (sinTrma)^ 

For any 2/2 > 2 /i > 0 with 2/2 — 2/i E 9 and 2/2 E Q/2, ( |4.13 1 gives us that 

C 


E \TM\< E 


yi<m<y 2 

q]m 


yi<m<y 2 

q\m 


A 


’ (sinTrmof)^ 


(4.13) 


(4.14) 


for A = (x/yi)(l + \ri'\i/{2{x/yx))) and C = {cq/ 4){y2/x). We must now estimate 
the sum 

E \TM\+ E (4.15) 

m<M ^<m<D 

q‘(m 


To bound the terms with m < M, we can use Lemma 4.1.2 The question is then 


which one is smaller: the first or the second bound given by Lemma 4.1.2? A brief 
calculation gives that the second bound is smaller (and hence preferable) exactly when 
\fCjA > (37r/109)(l + y^13/3). Since xJCjA ^ (^ 0 0 / 2 ) 771 / 3 ;, this means that 
it is sensible to prefer the second bound in Lemma 4.1.2 when m > c^xjq, where 

C2 = (3V5v^)(l + yW3). 

It thus makes sense to ask: does Q/2 < c^xjq (so that m < M implies m < 
c^xjqfl This question divides our work into two basic cases. 
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Case (a). 6 large: |(5| > l/2c2, where ci = (37r/5 Y^)( l + a/ 13/3). Then 
Q/2 < C 2 xlq\ this will induce us to bound the hrst sum in ( [4.15 i by the hrst bound in 
Lemma 14.1.21 


Recall that M = mm(Q/2, D), and so M < C 2 x/q. By (4.14i and Lemma 4.1.2 

^ ^ , IVIi 


^ \Tm{a)\ < ^ ^ min 

l<m<M j—0 jq<m<m\n{{j + l)q,M) 

q\rn q\m 


+1 


cq (j+i)g 

4 a; 

(sin TTmaY 


< 


20 00 ( 7 ^ 


37r^ Ax 

o<i<f 


(J + 1 ) < 




hcog 3 
37r^ 


— C9 
2 


20 Co <7^ 

4a; 

5coC2 


1 3 C2X 

o —^ o ^ 

2 q‘‘ 2 q^ 


^l< 


M- 


35coC2 

Btt^ 


-9, 


(4.16) 

where, to bound the smaller terms, we are using the inequality (5/2 < C 2 xlq, and 
where we are also using the observation that, since \5/x\ < l/qQrt, the assumption 
|(5| > 1 / 2 c 2 implies that q < 2 c 2 x/( 5 o; moreover, since q < Qq, this gives us that 
q^ < 2c 2X. In the main term, we are bounding qM"^/x from above by M ■ qQ(2x < 
M/26 < C 2 M. 

If D < {Q + l)/2, then M > \_D\ and so (4.16 1 is all we need: the second sum 


in (4.151 is empty. Assume from now on that D > [Q + l)/2. The hrst sum in (4.15 1 
is then bounded by ( |4.16| l (with M = Q/2). To bound the second sum in (4.151, we 
will use the approximation a'/q' instead of a/q. The motivation is the following: if 
we used the approximation a/q even for m > Q/2, the contribution of the terms with 
q\m would be too large. When we use a'/q', the contribution of the terms with q'\m 
(or TO = ±1 mod ( 7 ') is very small: only a fraction \/q' (tiny, since q' is large) of all 
terms are like that, and their individual contribution is always small, precisely because 
TO > Q/2. 


By (4.14 1 (without the restriction g -j" to on either side) and Lemma 4.1.1 

00 

" (a)l<E E \'^rn{a)\ 

Cm<D j—0 jq/^Q <Cm<m.in{(j-\-l)q'-\-Q/2,D) 

[ 


Q j2<m<D 


D-(Q + l)/2 


< 


< 


E 

7=0 

P-(Q + l)/2 j 

7=0 



where we recall that ci = 1 + | 77 '|i/( 2 a;/Z 7 ). Since q' > (e/(l + €))Q, 

D-(Q + l)/2 I 


E 

7=0 


jq 


- Q/2 


X 

7 


71 , 2x 

I Q+i t Q 


(1 + e)3 
eQ 


log"* 


D 

0+1 • 
2 


(4.17) 
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Recall now that g' < (1 + e)Q + !<(! + e)(Q + !)• Therefore, 

|^ D-i:Q + 1)/2 j _ 


E 

i=o 


'1 + - 


q' 


jq' + (Q + l)/2 


<q'\M + 


(1 + e)Q + 1 

(Q + l)/2 J 


D 

Q+i 


< qV3 + 2e + (D- 


Q +1 


q', + D 

2 '°® 


Q+l ■ 
2 


We conclude that J2Q/2<m<D \'^rn{c() \ is at most 

+ + «3 + 1 ) + (TL!22±i 


1 + 1* 


(4.18) 


log’' 


D 

Q+i 


+ 3ci ( 2 + iog+ 


D \ X 


/ Q 

2 / ^ 


We sum this to (4.161 (with M = Q12), and obtain that (4.15 1 is at most 
2^/^ (D + {1 + e){Q + 1) {w^ + \\og^ 


Q+l 

2 


,0 / o , (1 + ^) 1 ^ \ ^ , 35 coC2 

+ 3ci I 2 H-log .gq-j- I — H— q, 


(4.19) 


(4.20) 




where we are bounding 
5coC2 _ Scq 37r / 

and defining 

zu^ = \/3 -|- 26 “h 


= 4 « (1 

27r I 


1 + 


+ 




13 \ ^ 2v^ 1 
3 / “ TT '4 


1 + 


- 1 


2(1 + e) 


13 
(4.21) 


(4.22) 


(Note that < -s/S for e < 0.1741). A quick check against (4.16 1 shows that (4.20 1 
is valid also when D < Q/2, even when Q + 1 is replaced by min((5 + 1, 2D). We 
bound Q from above by x/\S\q and log’'" D/{{Q + l)/2) by log’*’ 2D/{x/\5\q + 1), 
and obtain the result. 

Case (b): |(5| small: |5| < l/2c2 or D < Qol2. Then min(c 2 x/g, H) < Q/2. We 
start by bounding the hrst q/2 terms in (4.15 1 by (4.13 1 and Lemma 4.1.3| 


E \Tm{a)\< Y. 


T.^qj2 


< 


m<ql2 

W\i 


hW\i 


cog/8a 


sin(7rTOQ;)| ’ | sin(7rTOa)p 


<7 max 2, log 


coe3g2 


(4.23) 


47r|p'|ix J 
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If < 2c2X, we estimate the terms with q/2 < m < c^x/q by Lemma 


which is applicable because min(c 2 a;/g, D) < Q/2\ 


4.1.2 


\Tm{oi)\ <Y Y 

(i-5)'?<m<(i+5)9 


^ <.m<D 
q]ni 


{j — q 2 ’ (sinTTwof)^ 


q\Tl 


< 


20 cog^ 
Stt^ Ax 


E 




J + o < 


20 co(7^ / C2a: D' 3 / C2X 


4a: \2g^ q 


5 

+ 8 


^^0 f _ r^, . ^ 5 q^\ ^ 5coC2 




^ ;;—n ( C 2 D' + 3c2(7 + —— I < 


4 X 


Gtt^ 




(4.24) 

where we write D' = min(c 2 x/g, Z?). If C 2 xlq > D, we stop here. Assume that 
C 2 xjq < D. Let R = max(c 2 a:/g, <7/2). The terms we have already estimated are 
precisely those with m < R. We bound the terms R < m < D hy the second bound 
in Lemma l4.Lll 


y: ir,„(a)i < y; y; 

R<m<D j—0 m>jq-\-R 

m<inin{{j-\-l)q-\-R,D) 

^ ^ 3cia; ^ Aq /ciCq 


Co (j+i)g+fl \ 

4 X \ 


jq + R^ (sinTTma)^ 


(4.25) 


j=0 


jq + R TT 


1 + - 


q 


jq + R 


(Note there is no need to use two successive approximations a/q, a'/q' as in case (a). 
We are also including all terms with m divisible by q, as we may, since |Tm(Q:)| is 
non-negative.) Now, much as before. 


E 


7=0 


and 




X 

X 


^ q 

2a; ^ 

1 X 

— + 


/ -at < mm 


— 

+ - 

R 

q , 

Jr i 

yc2’ 

q J 

' q 


log"* 


D 


C2xlq’ 


(4.26) 


E 

3=0 


1 + 


jq + R 




rD 


1 -f 


(4.27) 


^ D-R 1, + L> 

< Vi H-h - log ——. 

q 2 q/2 


We sum with ( |4.23| l and ( |4.24| i, and we obtain that ( |4.15| l is at most 


+ 3ci min —, — 

.C 2 q J 


q 2a: \ 55 coC 2 Ip'li 


127r2 


q + - - q ■ max 2, log 


CpV'q^ 

darlp'lia: 


(4.28) 
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where we are using t he fac t that 5coC2/67r^ < 2^coCi/7r to make sure that the term 
{5coC2/Qtt^)D' from (4.24 1 is more than compensated by the term —2-^coCiRItt com¬ 
ing from —R/q in (4.27 1 (by the definition of D' and R, we have R > D). W e can 
also use 5coC2/67r^ < 2.^coCi/7r to bound the term (5coC2/67r^)I?' from (4.24l by the 
term 2^cqC\D j-K in (|4.28 1 , in case c^xjq > D. (Again by definition, D' < D.) Thus, 
(4.28 I is valid both when c^xjq < D and when c^xjq > D. □ 


4.2.1 Type I: variations 

We will need a version of Lemma l4.2.1l with m and n restricted to the odd numbers. 
(We will barely be using the restriction of m, whereas the restriction on n is both (a) 
slightly harder to deal with, (b) something that can be turned to our advantage.) 


Lemma 4.2.2. Let a S K/Z with 2a = a/q + 6/x, {a,q) = 1, \5/x\ < I/qQo, 
q ^ Qo. Qo ^ 16- Let r] be continuous, piecewise and compactly supported, with 
\r]\i = 1 and rj" G Li. Let cq > |?7"|oo- 

Let I < D < X. Then, if\d\ < l/2c2, where C 2 = 67r/5y7io, the absolute value of 


p.{m) e{amn)r] (4.29) 

m<D n odd 

m odd 


is at most 


X . / , Co 


sy- 


plus 


E 


fi{m) 


m 


{m,2q) — l 


‘^y'CoCi 3ciX + D 

— -L> + —-log^- p- 

TT Z q C2x/q 


2|7'l 




, + D 

- -<7 log —fp 

TT q 2 


-q ■ max 1 


coe^q^ \ ( 2V3coCi 3ci 55coC2\ 


(4.30) 


(4.31) 


where Cl = 1 + |77'|i/(a;/i9) and M G [min((5o/2,79), 79]. The same bound holds if 
|(5| > 1/2c2 but 79 < (3o/2. 


In general, if\S\ > 1 /2c2, the absolute value of (^4.8\ is at most {4.30 \ plus 

2-^/ cqCi 


+ 2^1 2 + 


79 -(- (1 + e) min 

(1 + e) 


X 


l,2d) 


279 


\S\q) QO 


35coC2 


for e G (0,1] arbitrary. 


(4.32) 
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If q is even, the sum (4.30|l can be replaced by 0. 


Proof. The proof is almost exactly that of Lemma 4.2.1 we go over the differences. 
The parameters Q, Q', o', f and M are dehned just as before (with 2a wherever we 
had a). 

Let us hrst consider m < M odd and divisible by q. (Of course, this case arises 
only if q is odd.) For n = 2r + 1, 

e{amn) = e{am{2r + 1)) = e{2arm)e{am) 


= e —rm e -—h h — m 


2q 2x 2 


f5{2r + l) 

-1 

I 2a: 


a + Kqm\ , f5{2r + \) 


2 q 


= K e 


2x 


where k G {0,1} and k' = e((a + nq)/2) G {—1,1} are independent of m and n. 
Hence, by Poisson summation. 


n odd 


n odd 


e(amn)r/(mn/x) = k' e{{5rnl2x)n)r]{rnn/x) 

dd 


K 

Y 


(4.33) 


where f{u) = e{{dm/2x)u)r]{{m/x)u). Now 


w N X ^ / X S 

fit) = -vi—t- t; 

m \m 2 


Just as before, |a;/TO| > 2\5q\ > 26. Thus 
1 


XI H + 1/2) 


m 1 2 


-E 

2 

n^O 


X n 5 
m2 2 


a: I 1 
m 1 2 


X 

2m 








The contribution of the second term in the last line of (|4.34|l i 


(4.34) 


IS 




m<M 
m odd 
q\m 


Q Co /2 

X 2'k'^ 


m 


qco f 1 1 


m<M jq 
m odd 


M 


*8 27r2 * ' ^ 


X 
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Hence, the absolute value of the sum of all terms with m < M and q|m is given by 

( |430l l. 

We define Tm,o{a) by 


Tm,o{a) = ^ e{amn)ri ■ 

n odd 


(4.35) 


Changing variables by n = 2r + 1, we see that 


\Tm,o{a)\ = 




e(2a • mr)rj(rn[2r + I)/x) 


Hence, instead of (4.13 i, we get that 


w\ 


m Co 


1 


|T;„.o(a)| <min( + 2 | sin(2^TOa)r a; 2 (sm2^TOa)2 


(4.36) 


We obtain ( 4.14| i, but with Tm,o instead of T^, A = {x/2yi){\ + \'ri'\i/{x/yi)) and 
C = {co/2){y2/x), and so Ci = 1 + \ri'\i/ {x/D). 

The rest of the proof of Lemma pl.2.1 [ carries almost over word-by-word. (For the 
sake of simplicity, we do not really try to take advantage of the odd support of m 
here.) Since C has doubled, it would seem to make sense to reset the value of C 2 to be 
C 2 = (37r/5-y2co)(l + \/l3/3); this would cause complications related to the fact that 
5coC2/37r2 would become larger than 2y4io/7r, and so we set C 2 to the slightly smaller 
value C 2 = 67r/5y4io instead. This implies 


5coC 2 _ 2y^Co 
37r2 TT 


(4.37) 


The bound from ( |4.16 1 gets multiplied by 2 (but the value of C 2 has changed), the 
second line in (4.191 gets halved, ( |4.21| ) gets replaced by (4.37 1 , the second term in 
the maximum in the second line of ( |4.23| l gets doubled, the bound from ( |4.24| ) gets 
doubled, and the bound from (4.26 1 gets halved. □ 


We will also need a version of Lemma |4.2.1| (or rather Lemma [4.2.2[ we will decide 
to work with the restriction that n and m be odd) with a factor of (log n) within the 


inner sum. This is the sum Si,i in (3.91. 


Lemma 4.2.3. Let a S M/Z with 2a = a/q + 5/x, {a,q) = 1, |5/a:| < I/qQo, 
Q < Qoy Qo ^ max(16, 2-^0;). Let rj be continuous, piecewise and compactly 
supported, with |p|i = 1 and rj" € Li. Let cq > |??"|oo- Assume that, for any p > p^, 
Po a constant, the function rijp){t) = \og{pt)rj{f) satisfies 

|t?(p)|i < log(p)|77|i, < log(p)|77'|i, |7?"^)|oo < colog(p) (4.38) 

Let v/S < D < inm{x/po,x/e). Then, if\S\ < l/2c2, where C 2 = 67r/5y4io, the 
absolute value of 

(logn)e(amn)p (4.39) 


m<D 
m odd 


i odd 
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is at most 


^ ■ (. Co/^2\ 

V log * 

m mq 

m<^ ^ 

+ ^|log-?7(-i5)| 

dim) 

m<^ 


{m,q) = l 


{m,q) = l 


(4.40) 


+ O* ( Co ( ^ 1 ( ^ log 


iqx 


1d~ 


plus 

2-\/Co Cl 


+ 


ex 3cix + D q 

Dlog — + -- log^-- log — 

TT U 2 q C 2 x/q C 2 

(M 


5^) ‘“'S" + ^ 


^ + V^logg^ 

2 V C2 C2 OTT^ C2 


/or Cl = 1 + |r7'|i/(a;/I?). 77ze same bound holds if\S\ > l/2c2 D < (5o/2- 
In general, if\S\ > l/2c2, f/ze absolute value of ^4.39\ is at most 

2VcoCi „, ex 
--C)log7T + 

TT U 


(4.41) 


2\/ cqCi 


3ci 


(1 + e) ^\/3 + 2e • log+ 2\/e|(5|(7 + ^ log+ ^ log+ 2\5\q 

logx^ + yA/2coC2^^^ v^logx 


2 

7! 


l + e. 
2e 


/or e e (0,1]. 


(4.42) 


Proof. Define Q, Q', M, a' and q' as in the proof of Lemma |4.2.1| The same method of 
proof works as for Lemma 4.2. ![ we go over the differences. When applying Poisson 
summation or (2.2i, use r](x/m){t) = {\ogxt/m)rj{f) instead of p(f). Then use the 
bounds in (4.38|l with p = x/m\ in particular, 


Wlx/m)\^ < Co log 


m 


For f{u) = e{{5m/2x)u){\ogu)rj{{m/x)u). 
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X n 
2 

m / x\ Co , 2 . N 

2mV^° 'V 2y'--Vm;'v 2yy + 7r«™js?<’'-“>■ 

The part of the main term involving log(a:/m) becomes 


xri{-5) ^i{m) \ 

2 m Vm/ 

m<M 
m odd 
q\m 


’ ”* ^">97 

(77i,2g) —1 


for g odd. (We can see that this, like the rest of the main term, vanishes for m even.) 
In the term in front of tt^ — 4, we find the sum 


E m , / X \ M , X q 


rM/q 


m<M 
m odd 
q\m 


, x/q , 
flog ~~^dt 


= 1 / 2 , 


M , X , e"'^x 

= ~ loS T7 + 7— ^17“’ 

a; M Aqx M 

where we use the fact that t ^ t log(a:/f) is increasing for f < x/e. By the same fact 
(and by M < D), (M^/g) log(ei/^x/M) < (ZJ^/g) log(ei/^x/Z)). It is also easy to 
see that (M/x) \og{x/M) < 1/e (since M < D < x). 


The basic estimate for the rest of the proof (replacing (4.13 i) is 


Tm,oia) = ^ e{amn){logn)ri (^) = (~) 

n odd 

7|l/(a;/m)ll m \Wlx/m)\°° 


n odd 


I . I ^ I I , l^(a:/m)l 

— O I min I — '^(rr/m) 1 ^ ’ I * /O M ’ / ‘ O ^2 

' ' 2m ^ ’ 2 |sm(27rma)| x (sinzTrma)^ 


(- 


= O* log — • min 

y m \2m 


w\ 


llvll 


m Co 


I sin(27rma)| ’ x 2 (sin27rma)^ 


We wish to bound 


'y^. \Tm,o{o)\ + \Tm,o{oi)\. 

m<M ^<m<D 

q\na 
m odd 


(4.43) 


Just as in the proofs of Lemmas 4.2.1 and 4.2. 2[ we give two bounds, one valid for 
|(5| large (|(5| > l/2c2) and the other for S small (|(5| < Il2c2). Again as in the proof 
of Lemma[4.2.2| we ignore the condition that m is odd in (|4.15[). 
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Consider the case of |5| large first. Instead of (|4.16 1 , we have 


Since 


E \Tm{a)\< 

l<m<M 

(jfm 


E - 


40 coq^ 

Stt^ 2x 


E (■?' + 


o<i<f 


J9 + 1 


(4.44) 


o<j<^ 


jg + 1 

Ad X \ > X 

< log X + — log — + > log — 

q M iq 

i<i<- 


E 


i<i< —-1 


jq 


, M, X /'«, X, f “ , X , 

< log X H-log 77 + / log —at + t log —at 


M 


tq 

, /2M M2\ e^/^x 


tq 


this means that 


E \Tmia)\< 


l<m<M 

gfm 


40 coq^ 
Stt^ 4x 


logx - 


/2M e 

I -^-1 log - 


V 9 2q^ 


M 


(4.45) 


- + y V^cocEv^logx, 


where we are using the bounds M < Q/2 < C 2 xlq and q^ < 2c2X (just as in (4.16 1 ). 
Instead of (|4.17[), we have 


P-(0 + l)/2 


E 

i=o 


log 


/ j_ Q+1 


< 


^ / m' + ^ <5/2 


, 2x X f 


^ , X dt 

logTTT 

Q+i f f 


2x , + 2D 


2x, 2x X , _ . 

- 7T 7T + D l°g 77 log TC; 

Q Q q' Q Q 


recall that the coefficient in front of this sum will be halved by the condition that n is 
odd. Instead of (|4.18|l, we obtain 


|^ D-(Q + l)/ 2 j 


E 


'1 + 


9' 


log- 


j=0 


jV + (Q + 1 )/ 2 \ E 9 ' + ^ 


< ,V3+2e. log (^1+1) (log I) 

< q \/3 + 2e • log + D log — 

<3 + 1 , 2ex g' 2x , 2D 
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(The bound log{x/t)dt/t < \og{x/a) \og{b/a) will be more practical than the exact 
expression for the integral.) Hence 'YliQ/ 2 <m<D l^m(cr)| is at most 


2VcoCr ex 

— - D\og — 

TT D 


+ ((1 + e)V3 + 2e + log (Q + 1) log 


2x 

Q +1 


Si/cqCi Q + l. 2ex 3ci / 2 1 + e . 


Summing this to (4.45 1 (with M = Q/2), and using (4.21 1 and (4.221 as before, we 
obtain that (|4.43|l is at most 


2VcoCr ex 

— - Dlog — 

TT D 


, 2D , 2x 


+ + e)(Q + 1) ^v/3 + 2elog+ ^ log 

+ ^ + ^-^log+ v^logVa;+ yy2coC2^^v^loga;. 

Now we go over the case of |5| small (or D < Qq/2). Instead of ( |4.23[ ), we have 
^ |T^,o(a)| < ^gmax^l,log logo;. (4.46) 

m<q/2 ^ I / |1 / 

Suppose < 2c2X. (Otherwise, the sum we are about to estimate is empty.) Instead 
of ( |4.24| i, we have 

40 coq^ 


\Tm.,o{od)\ < 


^ <.m<D' 
gtm 


Stt^ 6x 


E 0 + ) 


(j -1) 9 


< 


< 


< 


lOcog^ 

Stt'^x 

lOcog^ 

3tt‘^x 

5coC2 

Stt^ 

5coC2 

Stt^ 


( 2x 1 f^ , a; 1/^1 a; D' a; \ 

log—+ -/ \og-dt+- tlog-dt-\ -log 7 ^ 

\ q q Jo t qJo t q d' j 


, 2a; f2D' Je:L 

log — + - ' ^ ^ \ V 


q 2 g 2 




^4V2c2a; log y + 4^20^log + D' log 

' log + 4v'2c2X log 

dJ' C2 J 


(4.47) 

where D' = mm{c 2 x/q, D). (We are using t he bo unds q^/x < ( 2 c 2 )^/^, D'q^ /x < 
C 2 q < and D'q/x < C 2 .) Instead of (4.25 i, we have 


L^ 9 ^J/ 3 ci . I 

i: iT„(o)i < ^ 


R<m<.D 


i=o 


jq + R TT 


1 + ^ 


log — 


jq + Rj jq + R 
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where R = max{c 2 x/q, q/2). We can simply reuse ( 4.26| l, multiplying it by logx/R; 
the only difference is that now we take care to b ound m infg/c^ ,2xjq) by the geometric 
mean \/ {q/C2){2x/q) = \f2xjc2. We replace (4.27 1 by 


E 


1 + 


jq- 


R 


, X / g , X 1 , 

log . , „ < W1 + - log - + - / 

jq + R V R R qjR 


D 


/-, q f D ^ ex R, 

< V3log — + — log —-log — 

C 2 \q D q R 


1, q, +D 
-log —log^ —. 
2 C2 R 


We sum with (4.46 1 and (4.47 1 , and obtain (|4.41|l as an upper bound for (|4.43 1. (Just 


in the proof of Lemma 


4.2.1 


<7 , X , 
1 + - \og-dt 
t ^ t 


(4.48) 


; as 


the term (5coC2/(37r^))ZJ'log(-\/ex/i7') is smaller than 


the term {2^c\ cq/tt) i? log exjR in (|4.48| , and thus gets absorbed by it when U > R. 
If U < i?, then, again as in Lemma|4.2.1 the sum 'YliR<ra<D \'Rm,o{,C!i) \ is empty, and 
we bound (5coC2/(37r^))i7'log(i/ex/U') by the term (2^01 cq/tt)!? log ex/D, which 
would not appear otherwise.) □ 

Now comes the time to focus on our second type I sum, namely, 

E Af”) E E e{avun)Ti{vun / x) ^ 


V odd 


\i<U 
Li odd 


i odd 


which corresponds to the term S '/2 in (3.9i. The innermost two sums, on their own. 


are a sum of type I we have already seen. Accordingly, for q small, we will be able to 
bound them using Lemma [4.2.2| If g is large, then that approach does not quite work, 
since then the approximation av/q to va is not always good enough. (As we shall later 
see, we need q < Q/v for the approximation to be sufficiently close for our purposes.) 

Fortunately, when g is large, we can also afford to lose a factor of log, since the 
gains from g will be large. Here is the estimate we will use for g large. 


Lemma 4.2.4. Let a S K/Z with 2a = a/q + 5/x, {a,q) = 1, |5/x| < l/gQo> 
g E Qo, Qo E max(2e, 2y/x). Let rj be continuous, piecewise and compactly 
supported, with j/yli = 1 and r}" € Li. Let cq > |p"|oo- Let C 2 = 67r/5y7io. Assume 
that X > 6 ^ 02 / 2 . 

Let U,V > 1 satisfy UV + (19/18)(5o E x/5.6. Then, if\6\ < l/2c2, the absolute 
value of 


E A!-) E p.{u) E e{avun)r]{vun/x) 


v<V 
V odd 


u<U 
u odd 


n odd 


(4.49) 


is at most 




+ 0-11 - 4 


'D^logV iU + lfV, 

Co 1 -— + ^— logg 


2gx 


2x 


(4.50) 
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plus 


TT V ve V 7 2 

2|7'|i 


3ci X, , D 

+ — - log L> log+-- 

2 g C 2 X q 


+ 


-g max 1, log 


D 

qj^ 

cpe^q^ 

' 4'K\rj'\iX 


, 9 

log 2 


(4.51) 




where D = UV and ci = l + \p'\i/{2x/D) andc 4 = 1.03884. The same bound holds 
if\5\ > 1/2 c2 butD< Qo/2. 

In general, if\6\ > l/2c2, the absolute value of {4.49) is at most {4.50\ plus 


?^Dlog® 

TT e 


2'y/co^i 


(Rig + 2e - 1) log 


.^^ + ilogZ71og+^ 

V2 2 ^ 


3ci /^1 3(1+ e) 

16e 


20 co 


logx ) + -^( 2 c 2 r^ I Vallog 


for e e (0,1]. 


(4.52) 


Proof. We proceed essentially as in Lemma 4.2.1 and Lemma 4.2.2 Let Q, q' and Q' 
be as in the proof of Lemma[4.2.2 that is, with 2a where Lemma 4.2.1 uses a. 


Let M = min({7T/, Q/2). We first consider the terms with uv < M, u and v odd, 
UV divisible by g. If g is even, there are no such terms. Assume g is odd. Then, by 
(|4.33|l and (|4.34|l, the absolute value of the contribution of these terms is at most 


( 

E 

a<M Lj|a 

a odd \a/U<v<V 
q\a 


\ 




/ 


xvi-5/2) 

2a 


O - 


a jry" 


X 271^ 


(tt" - 4) . (4.53) 


Now 


E E 

i<M v\a 

a odd n./U<v<V 


A{v)ii{a/v) 


q\a 


= E 


A(x) 


E 


p.{u) 


V u 

v<V u<m\n{U,M/V) 

V odd u odd 

(«,9) = 1 q\u 


E 

p^<V 

p odd 

pk 


A(j)° 


E 


p(m) 


u<min(U,M /V) 
u odd 

7—1'^ 

(q,p“) I 
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which equals 


T{q) Hv) 


0 V u 

v<V u<min(U/q,M/Vq) 

V odd 
(v,q) = l 


E 

iU/q,- 

(n,2g) = l 


fl{u) 


+ 




E 


A(p“) 


E 


t(u) 


p°:<V^ '' U M/V \ 

po”dd 

. u odd 

V ’ (q,P°‘) J 


= - O* 

q 


\ 


E 

v<V 

iv,2q)^l 


A{v) 


E 

p^<V 
p odd 

p\q 


logp 


p°‘/iq,p°‘) 




where we are using (2.20i to bound the sums on u by 1. We notice that 


y < y(iogp) 

po.<v ^ > podd 


i 


Aq) + E 


V 


a>Vp(q) 

p“<y 


P' 


1 

,a-Vp(q) 


/ 


p odd p\q 

p\q 

<logg+y(logp) y ^<logg+ y 
p odd /3>0 v<V 

p\q pl ^<— 

{v,q) = l 


and so 


E E 

a<M v\a 

a odd a/U<v<V 
q\a 


Aiv)p{a/v) Q* 


logg+ y 

v<V 

. {v,2)=l 


A{v) 


= - ■ 0*(}ogq + log 17) 


by (2.12 1 . The absolute value of the sum of the terms with f}{—6/2) in (4.53 i is thus at 
most 


x v{-S/2) 

q 2 


(logg + logy)< 


Cq 

{nSy 


log 17^, 


where we are bounding rj{—5/2) by (2.1 1 (with k = 2). 
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The other terms in (|4.53|l contribute at most 


Y.Y.Mv)uv. 

u<Uv<V 
uv odd 

uv<M, q\uv 
u sq-free 


(4.54) 


For an y R, J2u<R,uodd,q\u - + 377/4 . Using the estimates (2.12i, (2.15 i 

and (|2.16|), we obtain that the double sum in (|4.54|i is at most 


v<V u<m.in{U,M/v) p^<V u<U 

{v,2q) — l odd p odd w odd 

9 l“ p\q (,,p“) K 

«—T I I /1 


< 


^ A(u)u- ( —J + ^ (logp)p 

V<V \ y / 

iv,2q)=l 

7V7^ log V 3 c4 


(u + iy 


(4.55) 


4q 


po<v 

p odd 

p\q 

^-MV+ ^ ^ ^ Ulogg, 


where C 4 = 1.03884. 

From this point onwards, we use the easy bound 


Mv)p{a/v) 

v\a 

a/U<v<V 


< log a. 


What we must bound now is 


E 

m<UV 


(log m) E e{amn)ri{mn / x). 

n odd 


m odd 

q \ m 01 m > M 


(4.56) 


The inner sum is the same as the sum Tm,o{oL) in (4.35 1 ; we will be using the bound 
( 4.36| l. Much as before, we will be able to ignore the condition that m is odd. 

Let D = UV. What remains to do is similar to what we did in the proof of Lemma 
|4.2.1| (or Lemma |4.2.2| l. 

Case (a). S large: |(5| > l/ 2 c 2 . Instead of (4.16 1 , we have 


(logm)|r™,o(a)| < Y (J + 1 ) log(i + 1)7, 

1<?7i<M o<j< — 
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and, since M < min(c 2 a;/g, D), q < y/2c2X (just as in the proof of Lemma 4.2.1 1 and 
(j + 1) log(j + l)g 


o<i<f 


M fM \ I 

< — logM+ — + 1 log(M +1) + ^ / tlogtdt 

9 \Q J r Jo 


M 


m2 , M 


< (2- + l)logx+^log^, 


we conclude that 

Y \'^rn,oia)\ < ^^Mlog^ + ^^(2c2)^/^^/iloga;. 


l<m<M 

q]m 


(4.57) 


Instead of ( |4.17| l, we have 

I D-(Q + l )/2 
L q/ J 

E 


3=0 


jq' + ^ 




^ X , Q + l X 

< ^ log — 

2 


rD 


_ log^ 

q' Js±l t 


dt 


2x Q , (l + e)x ^^2 f^^^Q 


< -log-h 

- Q ^2 2eQ 


(logD)2 - log- 


Instead of ( |4.18| ), we estimate 

[^J 

q' Y ( 


3=0 


Q +1 


+ jq 


n + 




q' 


jq' + ^ 

rD 


< q' ^logD + (V3 + 2e - 1) log ^ ^ logt dt + 

< q' ^logD + (\/3 + 2e- l) log ^ ^ ^ ^ ^ 

+ |logDlog+^. 


^ g'log t 
2t 


dt 


D Q + 1, <3 + 1 

■D log-— log —— 

e z Ze 


We conclude that, when D > Q/2, the sum X!Q/ 2 <m<D(^^S^)l^"i('^)l niost 

^£>log ^ + {Q + 1) ^(1 + e)(\/3 + 2e - 1) log 
VooCi 


Q + 1 1, Q + 1 


((3 + l)(l + e)logDlog'' 


e^D 

Q+i 

2 


3ci / 2x Q (1 + e)x / 


(log D)"- I log 2 
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We must now add this to ( |4.57| l. Since 

(1 + e)(-\/3 + 2e — 1) log V2 — ^ log 2e H- ^ \og2y/e > 0 


and Q > 2^/x, we conclude that (|4.56|l i 


IS at most 


?^Dlog:7 

TT e 


?^^(l + e)(Q + l) |^(v/3T^-l)log^ + ilogZ71og^ 


e^D 

Q+l 


3ci fl 3(1+ e) 
16e 


logx^ + Vi; log x. 


(4.58) 

Case (b). S small: |(5| < l/2c2orD < Qol2. The analogue of (4.231 is a bound of 


< 


2|V 




for the terms with m < q/2. If q'^ < 2c2X, then, much as in (4.24 1 , we have 


X] |T’m,o(a)|(logTO) < 


^ <m<D' 
q]m 


10 Cpq^ 
TT^ 3x 


10 Cog 


E 


J + 2 ) 


(4.59) 


r-£>' + §</ 


< ^— / x\ogx dx. 

TT oX j f. 


Since 


fD' + %q 


= T^D'^ + T^D'q] log 


3 

V 

D' 3 g 

Ve 2l2' 


a; log a; dx = i ( D' + ^g ) log- jJ --g^ log ^ 


D' , 3^, 


= 7 ;D log + -D'q log -D' + -g^ - + - + log D' + —g 


sfe 2 


8 


sfe 


Oa, D + ^2] 7 

log^^--, log^ 
19 


18 


where D' = min(c 2 x/g, 79), and since the assumption (UV + (19/18)(5o) ^ x/5.6 
implies that (2/9 + 3/2 + log(i9' + (19/18)g)) < x, we conclude that 


< 


17)71.0(0) I (log to) 

f <m<D' 
q\m 

3CoC2 ^^( 2 c 2 )^/^VxlogX + ^ ( 2 C 2 )^/^ Vx log X^ 


37r2 
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Let R = max(c 2 a;/ 9 , ( 7 / 2 ). We bound the terms R < m < D as in (4.25 1 , with a 
factor of log{jq + R) inside the sum. The analogues of (4.26 1 and (4.27 1 are 


E 

i=o 


jq + R 


pD 

log(j 9 + 7?) < ^ log i? + - / 

R q Jr 


\ogt 


dt 


< 


log 


C 2 X X , ^, I D 

- + -logDlog+-, 


where we use the assumption that x > e^c/ 2 , and 


E log0'9 + ^)\/l 

i=o 


. n < VSlogR 
jq + R 


1 


D 


R 


- ( Dlog— -iJlog— ) + -logDlog — 


1 


D 


(4.61) 


(4.62) 


(or 0 if 77 < R). We sum with (4.60 1 and the terms with m < q/2, and obtain, for 
D' = C2x/q = R, 


2-^/ CqCi 


D 


77 log ^ + g ( a/s log- 


Ve 


C 2 X log 77 


77 \ 


2 


3ci a; + 77 

— - log 77 log^- - 

2 q C 2 X q 


2|7'l 


■ q max ( 1 , log 


cpe^g^ 

AT:\ri'\ix 


1 7 

log 5 


+ * 5(202)“-'"v^log-. 


which, it is easy to check, is also valid even if 77' = 77 (in which case (4.61 1 and (4.62 1 
do not appear) or R — q/2 (in which case (4.601 does not appear). □ 


























Chapter 5 

Type II sums 


We must now consider the sum 

/ \ 


Sii = 


m>U 


A{n)e{amn)ri{mn/x). (5.1) 


d>U 


n>V 


{in,v) — l Vftim / {n,v) — l 

Here the main improvements over classical treatments of type II sums are as fol¬ 
lows; 


1 . obtaining cancellation in the term 

Y 

d>U 

d\m 

leading to a gain of a factor of log; 

2 . using a large sieve for primes, getting rid of a further log; 

3. exploiting, via a non-conventional application of the principle of the large sieve 
(Lemma [5.2.1[ ), the fact that a is in the tail of an interval (when that is the case). 

It should be clear that these techniques are of general applicability. (It is also clear that 
Q is not new, though, strangely enough, it seems not to have been applied to Gold- 
bach’s problem. Perhaps this oversight is due to the fact that proofs of Vinogradov’s 
result given in textbooks often follow Linnik’s dispersion method, rather than the large 
sieve. Our treatment of the large sieve for primes will follow the lines set by Mont¬ 
gomery and Montgomery-Vaughan 0MV73I (1.6)]. The fact that the large sieve for 
primes can be combined with the new technique Q is, of course, a novelty.) 

While Q is particularly useful for the treatment of a term that generally arises in 
applications of Vaughan’s identity, all of the points above address issues that can arise 
in more general situations in number theory. 
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It is technically helpful to express rj as the (multiplicative) convolution of two func¬ 
tions of compact support - preferrably the same function: 


ri{x) = rji *M Vi 





(5.2) 


For the smoothing function 77 (f) = 772 (f) = 4max(log 2 — | log 2f|, 0), equation (5.2 1 
holds with 771 = 2 • l[i/ 2 ,i], where l[i/ 2 .i] is the characteristic function of the interval 
[1 /2,1]. We will work with 77 = 772 , yet most of our work will be valid for any 77 of the 
form 77 = 771 * 771 . 

By (|5.2[), the sum (|5.1[) equals 


/ ^ 

'0 m>U 


— l V 

lU 


( \ 

E r 7\ A / \ 7 \ / \ ('nin/x\ dt 

M(a) 2 ^ A{n)e{amn)r]i{t)r]i I —— ly 

j (n,?;) —1 

/ 


d>U 


= 4/ E 

max(5fjy,C/)<m<.fr 
(m,D) = l 


IV 


\ 

E E 


d>U 
^ d\m 


ma.^(y,^-)<n<W 


A[nje{amn) 


(5.3) 


by the substitution f = {m/x)W. (We can assume V < W < x/U because otherwise 
one of the sums in ( |5.4| l is empty.) As we can see, the sums within the integral are now 
unsmoothed. This will not be truly harmful, and to some extent it will be convenient, 
in that ready-to-use large-sieve estimates in the literature have been optimized more 
carefully for unsmoothed sums than for smooth sums. The fact that the sums start at 
x/2W and W/2 rather than at 1 will also be slightly helpful. 

(This is presumably why the weight 772 was introduced in 0Taol4L which also uses 
the large sieve. As we will later see, the weight 772 - or anything like it - will simply 
not do on the major arcs, which are much more sensitive to the choice of weights. On 
the minor arcs, however, 772 is convenient, and this is why we use it here. For type I 
sums - as should be clear from our work so far, which was stated for general weights 
- any function whose second derivative exists almost everywhere and lies in £1 would 
do just as well. The option of having no smoothing whatsoever - as in Vinogradov’s 
work, or as in most textbook accounts - would not be quite as good for type I sums, 
and would lead to a routine but inconvenient splitting of sums into short intervals in 
place of ( |5.3[ ).) 

We now do what is generally the first thing in type II treatments: we use Cauchy- 
Schwarz. A minor note, however, that may help avoid confusion: the treatments fa¬ 
miliar to some readers (e.g., the dispersion method, not followed here) start with the 
special case of Cauchy-Schwarz that is most common in number theory 


E 

n<N 


n<N 


2 
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whereas here we apply the general rule 




^tEi 


/E 




to the integrand in ( |5.3| l. At any rate, we will have reduced the estimation of a sum 
to the estimation of two simpler sums lomP^ Thm of these two 

simpler sums will be of a kind that we will lead to a loss of a factor of log x (or 
(loga;)^) if not estimated carefully. Since we cannot afford to lose a single factor of 
log X, we will have to deploy and develop techniques to eliminate these factors of log x. 
The procedure followed will be quite different for the two sums; a variety of techniques 
will be needed. 


We separate n prime and n non-prime in the integrand of (5.31, and, as we were 
saying, we apply Cauchy-Schwarz. We obtain that the expression within the integral in 
(5.3 1 is at most W) ■ S 2 {U, V, W) + W) ■ 53(14^), where 


Si{U,W) 


S2{U,V,W) 


E 


( V 

E 

d>U 

\ tZ|?71 / 


E 

{m,v) — l 


E 


(log p)e{amp) 


^{V,'^)<p<W 

ip,v)-l 


(5.4) 


and 


2 


SsiW) = 


E 

2 W w 

{7n,v) — l 


E 

n<W 

n non-prime 


= (l.42620kk^/2)^ 

2 W w 
{m,v) — l 


< 1.0171x + 2.0341iy 


(5.5) 


(by IIRS62I Thm. 13]). We will assume V < w; thus the condition {p, v) = 1 will be 
fulfilled automatically and can be removed. 

The contribution of S' 3 (Vk) will be negligible. We must bound 51(17,417) and 
82 ( 11 , V,W) from above. 
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5.1 The sum cancellation 

We shall bound 


Si{U,W) 


( V 


E 

rasTKiU ,x l2W)<rn<x /W 
{m,v) = l 


E 

d>U 

\ d\m J 


(5.6) 


There will be a surprising amount of cancellation; the expression within the sum 
will be bounded by a constant on average - a constant less than 1, and usually less than 
1/2, in fact. In other words, the inner sum in (5.61 is exactly 0 most of the time. 

Recall that we need explicit constants throughout, and that this essentially con¬ 
strains us to elementary means. (We will at one point use Dirichlet series and (/(s) for 
s real and greater than 1.) 


5.1.1 Reduction to a sum with /r 

It is tempting to start by applying Mobius inversion to change d > U to d < U in 
( |531 ), but this just makes matters worse. We could also try changing variables so that 
m/d (which is smaller than x/UW) becomes the variable instead of d, but this leads 
to complications for m non-square-free. Instead, we write 


E 

m.ax{U,x/2W)<7n<x/W 


( V 

E 

d>U 

\ d\m j 


E E n{di > U)iJi{d2 > U) 

2 W ^ ^1,(^2 1^71 


= E E E 


ri<x/WU r 2 <x/WU I 

(ri.r2) = l {l,rir2) = l 

{rir 2 ,v)=l ril,r 2 l>U 

ie,v) = l 


_^_<' 777 <r ^ 

2W w 

rir 2 l\'Hi 
{m,v) — l 

(5.7) 


where di = ril, d 2 = r 2 l, I = {di,d 2 ). (The inequality ri < x/WU comes from 
rir 2 l\m, m < x/W, r 2 l > t/; r2 < x/WU is proven in the same way.) Now (5.71 
equals 


E E E M(n)Ai(r2) ^ 


®<Trt7 

(s,ii) = l (r-i,r 2 ) = l 

{rir 2 ,v) = l 


min(ri , 7 - 2 ) ’ 27.17.2s 7 ^ — T.1T.2S 

{l,rir 2 ) = l,Ml)P = l 
(e,v)=i 


1 , 


(5.8) 


where we have set s = m/{rir 2 l). We begin by simplifying the innermost triple sum. 
This we do in the following Lemma; it is not a trivial task, and carrying it out efficiently 
actually takes an idea. 
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Lemma 5.1.1. Let z,y > 0. Then 


E 


ri<v r 2 <y 
{ri,r2) = l 
{rir2,v) = l 


niinf — . ^ ,75-^—'j <l< —^— 

\ min(T'i ,T’ 2 ) ’ 2 t’jT ’2 / —^1^2 

(J,t’i»'2 ) = 1 ,(m(0 )^ = 1 

(e,v)=i 


equals 


Sz V y{ri)fj.{r2) / (^ '[1 '[1 

^i'v) ^y C^(ri)a(r2) V V2’ y ’ y 


(ri,r 2 ) = l 
(rir 2 ,v) = l 


+ O* 5.08 C 


yVz ■n (^ 


p\v 


Vp 


1 - 


,3/2 


Ifv = 2, the error tenn in ( |5.7Q| ) can be replaced by 

O.(l.27C0)\vi.(l+^) (l-ji 


23 / 2 ) 


Proof. By Mobius inversion, ( |5.9| l equals 

E E 

/>min ^ 


ri<y r 2 <y 
(ri,r 2 ) — l 
{rir2,v) — l 


l<^- 

— r’ir2 
s/y z 

niin(ri ,r 2 ) ’ 2 rir 2 


(5.9) 


(5.10) 


(5.11) 


(5.12) 


Y T{di)p{d 2 ) 

di\ri,d2\r2 
^ did 2 \l 

E/"('^3) E p.{m). 

d^lv m^\l 

d^ll {m,rir2v) — l 


We can change the order of summation of and di by defining Si = Tijdi, and we can 
also use the obvious fact that the number of integers in an interval (o, b] divisible by d 
is (b — a)/d + 0*(1). Thus ( |5.12| i equals 


Y p{di)p.{d 2 ) 

E ^ 

di,d2<y 

si<y/di 

(di,d2) — l 

S2<y/d2 

{did2,v) — l 

(diSi,d2S2) = l 


(siS2.'u) = l 

Yp(^3^ y 

p,(m) 
di d^diim- 


S2‘^3 


1 — max 


1 Sidi 32^(2 \ 

2’ir’ir; 


{m,diSid2S2v) — l 


(5.13) 
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plus 


O* 


E E E E 

di,d2<y si<y/di dslv m< / , ■ 

V 


{did2,v) — l S2<y/d2 
{siS 2 ,v) = l 


m sq-free 


(5.14) 


/ 


If we complete the innermost sum in ( 5.13| ) by removing the condition 

m < y z/{d\sd\s 2 ), 

we obtain (reintroducing the variables = diSi) 


ri,r 2 <y 
(rir 2 ,v) = l 


rir 2 


EE E 

dipi dslv 

d 2 \r 2 


o ’ ’ 

^ y y 


(5.15) 


did 2 d^m? 


{m^r\r 2 v) — l 


times z. Now ( |5.15| l equals 

y{ri)fi{r2)z 


E 

ri,r 2 <y 
(ri,r 2 ) — l 
(rir2 ,v) = l 


rir 2 


6 z V 


. 1 r"! ro 
1 - max — 

^ y y 


p\rir 2 


n n 1 


pft) 


{ri,r 2 ) — l 
{rir 2 ^v) — l 


o ’ ’ 

^ y y 


i.e., the main term in (5.10i. It remains to estimate the terms used to complete the 
sum; their total is, by definition, given exactly by ( |5.13| l with the inequality m < 
\/zj (df S(i2®2<^3) changed to to > yjz/{d\sd'^a 2 d'i)- This is a total of size at most 


^ E E E E 

di,d 2 <y Bi<y/di dslv m> / ^ j 
{did 2 ,v) = l S 2 <y/d 2 ^ 

(siS2,'l') = l 


1 


- did2d-ira^ SidiS2d2 


(5.16) 


771 sq-free 

Adding this to (|5.14|l, we obtain, as our total error term. 


E E E/ 

di,d2<y s\<yldi dz\v 
{did2,v) = l S2<yld2 
(siS2,l^) = l 


C?fsi(i2S2d3 ) ’ 


(5.17) 
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where 


fix) 


E 1 + 

m<x 
m sq-free 



m>x 
m sq-free 


It is easy to see that f{x)/x has a local maximum exactly when a; is a square-free 
(positive) integer. We can hence check that 


/(*) < i 




X = 1.26981... a; 


for all a: > 0 by checking all integers smaller than a constant, using {m : m sq-free} C 
{m : 4 } m} and 1.5 • (3/4) < 1.26981 to bound / from below for x larger than a 
constant. Therefore, (|5.17|i is at most 


1-27 E E E 

di,d2<y si<y/di d3\v 

{did2,v) — l S2<yjd2 

{SIS2,V) — 1 


dfsid|s2(i3 

/ 




p\v 


\ 


E E 


1 


dUs 

d<y s<y/d 
\{d,v) = l (s,t)) = l / 


We can bound the double sum simply by 


E E 

d<y s<y/d 
{d,v) — l 






Alternatively, if u = 2, we bound 




s<y/d s<.y/d 

{s,v) — l s odd 


and thus 


A^n, V A^ni ' / \ / 


d<y s<y/d 
(d,v) = l (s,'!)) = ! 


d<y 
{d,2) = l 


□ 


Applying Lemma 5.1.1 with y = S/s and z = x/Ws, where S = x/WU, we 
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obtain that (|5.8|l equals 


6 a; V 1 ^(ri)/z(r2) / f I n r 2 \ 

TT^Waiv) ^ s ^ ^ a(riMr2)'v 

s<b ri<S/sr2<S/s 
(s,«) = l (ri,r 2 ) = l 

(riT’2,'u) = l 



with 5.04 replaced by 1.27 if u = 2. The main term in 


6 a; V 1 

tt'^W a(v) s , 


1/2 


E E 

(ri,r2) = l 
(rir2,'u) = l 


( |5.18| l can be written as 
cr(ri)a(r2) 


(5.18) 

(5.19) 


As we can see, the use of an integral eliminates the unpleasant factor 

(‘-■"“(lA'A))- 

From now on, we will focus on the cases v = 1 and u = 2 for simplicity. (Higher 
values of v do not seem to be really prohtable in the last analysis.) 


5.1.2 Explicit bounds for a sum with /r 


We must estimate the expression within parentheses in ( |5.19 1. It is not too hard to 
show that it tends to 0; the first part of the proof of Lemma 5.1.2 will reduce this to the 


fact that fi{n)/n = 0. Obtaining good bounds is a more delicate matter. For our 
purposes, we will need the expression to converge to 0 at least as fast as 1/(log)^, with 
a good constant in front. For this task, the bound (2.211 on Yln<x enough. 


Lemma 5.1.2. Let 


9v{x) := ^ 




mAi)fA 2) 


^ cr(ri)cr(r2)’ 

rx<x r2<x V / \ / 

(T’l,'r2) = l 
(rir2,'u) = l 


where v = 1 or v = 2. Then 


[ lA 

|5iA)I < { ^(111.536+ 55.768 log a;) 

0.0044325 I 0.1079 

f (loga:)2 ^ 


if33<x< 10®, 
i/io® < X < 10A 
ifx > lOA 


[ 2.1/x 

| 52 A)| < < i(1634.34 +817.168logx) 

0.038128 I 0.2046 
1. (logx)^ “I" ^ ■ 


if33<x< 10®, 
if 10® < X < lO^o, 
ifx > IQio. 
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Tbe proof involves what may be called a version of Rankin’s trick, using Dirichlet 
series and the behavior of (^(s) near s = 1. 


Proof. We prove the statements for x < 10® by a direct computation, using interval 
arithmetic. (In fact, in that range, one gets 2.0895071/a; instead of 2.1/a;.) Assume 
from now on that x > 10®. 

Clearly 


Moreover, 


and 


ri<xr2<x \(i|(ri,r2) 
{r\r 2 -,v) — l 

= i: 1‘W Y. Y 


p{ri)n{r2) 

CT(ri)cr(r 2 ) 

/i(ri)/a(r 2 ) 


d<x 

ri<x r2"^x 

(d,v) — l 

<l\(ri,r2) 


(rir2 


E 

d<x 

Kd) 

E 

uiKxj 

{a{d)Y 

(d,v) — l 

(■ 

Up .,dv)-- 



( 

sr 

T{d) 1 

V 


/a(ui)p(M 2 ) 

a{ui)a{u 2 ) 


d<x 
(d,v) — l 


{aidW 


\ 

Tjr) 

^ a{r) 

<x/d ' ' 
\(r.dv) — l j 


^ a(r) ^ r ^^J-J-Vp+l 

r<xld r<xjd d'\r p\d' ^ 

{r,dv) — l {r,dv) — l 


- 1 


E n 


-1 I y- mW 

, 0 + 1 I r 

d'<xjd J r<xld 

t,(d'P = l 
{d' ,dv) — l 


1 


E 


{r,dv) — l 
d'\r 

fi{r) 


E d'aid') ^ r 

d'Kxjd r<xjdd' 

fi(d^)^ — l (r,dd'v) — l 

(d' ,dv) — l 


jt j-i 

r<xjdd' d''<xjdd' r<xjdd'd" 

(r,dd'v) — l d"\{dd'v)°° 


(5.20) 
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Hence 


\9{x)\ < 

d<x 


{a{d)f 


( 

E 

d'<xjd 
t,(d'p=i 
\{d' ,dv) — l 


^ ’ d"<xldd' 

d'^\{dd'v)°° 


(5.21) 


where f{t) 


Er<iM(^)A 


We intend to bound the function /(<) by a linear combination of terms of the form 
6 G [0,1/2). Thus it makes sense now to estimate T\,(si, S 2 , x), defined to be the 
quantity 


{p{d)Y 

{d,v) — l 


( 

E 

d\ 


E 

^2 


p{d',? 

d'i<j{d'i) 


d' 2 (x{d' 2 ) 


E 

d'.^ I {dd’.^v)°° 


E 


1 


1 


\ 

) 

\ 

{dd'^d'if-^^ 


for si,S 2 € [1/2,1]. This is equal to 

T{dY TT_1_ 

. (1 +p-‘f(1 -p-)n* (1-P-) 

(rf,v) —1 

d-jd'Y TT _ \ _ 

d p'\d' 

\{d'4v)^l / 

y^ d-jd'Y TT _ \ _ 

"E ((i')S2 + l 11 (1 _|_p'-l) (1 _p'-S2) ’ 

d' p'\d' 

\{d',dv) — l / 


which in turn can easily be seen to equal 


n 


1 + 


p-^lp-^2 


(1 — +P~^){1 — +p-l) J J- J- (1 — p-®l)(l — p~^^) 


n 


p\i 


■n(i+ 

p\i 


p ^p 


(1+p-i)(i 


n(i+ 

pfj 


p Ip S2 


(1 +p-l)(l -p-«2) 


(5.22) 
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Now, for any 0 < x < y < < 1, 

{l+x-y){l-xy){l-xy^)-{l+x){l-y){l-x^) = {x-y){y'^-x){xy-x-l)x < 0, 


and so 


1 + 


xy 


{I T X - y){l - xy){l - xy"^) 


< 


(l-x3) 


(l + x)(l-y) {I + x){l - y){l - xy){l - xy'^) {I - xy){l - xy'^)’ 

(5.23) 

For any x < yi,y 2 < 1 with yf < a;, < x. 


1 + 


yiy2 


< 


(1 — a;^)^(l — x'^) 


(1 - t/i + a;)(l -y 2 +x) (1 - yiy 2 )(l - yij/|)(l - y? 2 / 2 )' 


(5.24) 


This can be checked as follows; multiplying by the denominators and changing vari¬ 
ables to X, s = yi + j/2 r = yiy2, we obtain an inequality where the left side, 
quadratic on s with positive leading coefficient, must be less than or equal to the right 
side, which is linear on s. The left side minus the right side can be maximal for given 
X, r only when s is maximal or minimal. This happens when yi = y 2 or when either 
yi = \ fx or yi = X for at least one of i = 1, 2. In each of these cases, we have re¬ 
duced (5.24 1 to an inequality in two variables that can be proven automaticalljj^by a 
quantifier-elimination program; the author has used QEPCAD IHBllI to do this. 

Hence Fy{si, S 2 , x) is at most 


n 


(i-p 


n 


1 


(1 — _ p-2si-S2)(X _ p-Sl-2s2\ 11 n _ )(1 — p-S2\ 

p\v p I V 


n 


i-p 


-3 


Si- 1'1 n 


1 - p 


-3 


(1 -f P“®1“1)(1 -|-p“2si-l) 11 (1 _|_p-S2-lUl _|_p-2s2-ll 

p\v p\v 




C(si -I- 1)C(S2 + l)C(2si + 1)C(2s2 + 1) 


.,S1.S2 + S2)C(S1 + 2s2))-i ’ 

where Cv^si,s 2 equals 1 if i; = 1, and 

(1 - + 2-"i-i)(l -f 2-2 «i-1)(1 -p 2-*2-1)(1 -f 2-2«2-1) 

(1 - 2-«i+'^2)-i(i _ 2-2 si-«2)-i(i _ 2 -«i)(l - 2-«2)(1 - 2-3)4(l - 2-4) 

ifu = 2. 

For 1 <t < X, ( |2.21| l and p.24| l imply 


(5.25) 


f{t)< 



ifcc < 10^° 

log log 10^^ 

Ugx (f) ifx>10^°, 


(5.26) 


^ In practice, the case pi = y/x leads to a polynomial of high degree, and quantifier elimination increases 
sharply in complexity as the degree increases; a stronger inequality of lower degree (with (1 — 3ir^) instead 
of (1 — — x‘^)) was given to QEPCAD to prove in this case. 
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where we are using the fact that log x is convex-down. Note that, again by convexity, 

log log x - log log 10 ^° , 1 

log a: — log 10^0 i-iogio « loglO^^ 


= 0.0434294 ... 


Obviously, ^/2/i in (5.26 1 can be replaced by for any e > 0. 

By ( |5%] ) and ( |5.261 l, 

/ 2 \ 

\9 v{x)\<i-\ Fy(l/2Pt,l/2 + e,x) 


for X < 10^°. We set e = 1/ log x and obtain from (5.25 i that 


Fv{I/2, + e, 1/2 -I- e, a;) < 


C(l + 2e)C(3/2)4C(2)= 


C(3)4C(4) 


< 55.768 


1 + 


log a; 


(5.27) 


where we use the easy bound ((s) < 1 -I- l/(s — 1 ) obtained by 


Fdt. 


^ 1 

(For sharper bounds, see IIBR02II .1 Now 

/ (1 - 2-3/2-^)2(l + 2-3/2)2(1 + 2-2)2(1 - 2 -^-^^) 

- ■ ( 1 _ 2 - 1 / 2 ) 2 ( 1 _ 2 - 3 ) 4 ( 1 _ 2 - 4 ) ' 

< 14.652983, 

whereas = 1. (We are assuming x > 10®, and so e < l/(log 10®).) Hence 


l5«(a^)l < ) I 


i(111.536-f 55.7681oga;) ifu = l. 


(1634.34-f 817.168 log a;) ifu = 2. 


10 


for 10 ® < a; < 10 

For general x, we must use the second bound in (5.26 1 . Define c = l/(log 10^®). 
We see that, if a: > 10^®, 


\9v{x)\ < 


0.03^ 


, 7^1 (1 ^5 1 c) * —c,l —c 


(loga;) 2 ' 

+ 2 • ^^F(l - c, 1 / 2 ) ■ 

y/X log X ’ ’ / 

-f -(111.536 -f 55.768 log a;) • i+,. 


For V = 1, this gives 


I. 1 (.)I < + 4#^ + 4111.536 + 55.768log.) 


(loga :)2 y^loga; x 
0.0044325 0.1079 

<- n —I- 
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for z) = 2 , we obtain 


, , ,, 0.038128 25.607 

\g2[x)\ < — -^ - 

(loga:)^ ^xlogx 

0.038128 0.2046 

“ (loga ;)2 yCc 


-(1634.34+ 817.168 log x) 

X 


□ 


5.1.3 Estimating the triple sum 

We will now be able to bound the triple sum in ( |5.19| l, viz., 

- / gv{uS/s)du, 
s<S ''•^1/2 
(s,v) —1 


(5.28) 


where is as in Lemma [5.1.2| 

As we will soon see. Lemma [5. 1.2 [ that ( |5.28 1 is bounded by a constant (essentially 
because the integral l/t:(log<)^ converges). We must give as good a constant as 
we can, since it will affect the largest term in the hnal result. 


Clearly gv{R) = <?«( L^J )■ The contribution of each gy{m), 1 < to < S', to (5.28 1 
is exactly g^ (to) times 

1 1 f{m+l)s/S 

E E ' 

(s,d) = 1 (s,t;) = l 

^ p{m-\-l)s/S 

^ Jl/2 


S S 

2m m + 1 


s/S 


E 


2(m + l) 2m 

(s,L?) —1 

1 
S 


E 


+ 


E 


5 <5< g 
2m m+1 

— 1 


2(m + l) 277 

— 1 


du= 

{s,v) — l 

m + 1 1 


Idzt 


1 TO 
s “ 


(5.29) 


Write/(f) = l/SforS/2TO <t< S/{m + l), f{t) = Oforf > S/moit < S/2(to+ 
1)> f{t) = 1/^ ~ m/s for S/(to + 1) < f < S/m and /(f) = (to + 1)/S — l/2f for 
S/2{m + 1) < t < S/2m\ then (5.29 1 equals X]n (n v)=i /(’^)' Euler-Maclaurin 
(second order), 

'^fin)=[ f(.x)-^B 2 {{x})f'{x)dx=f f{x) + 0* [^\f'{x)\\dx 

„ J —oo J —oo \ / 


I "’(ly 


3 

2 m 


f 


TO + 1 


= ^ log (1 + — 
2 V TO 


O* 


2 to^ ^ ^to + 1 


(5.30) 
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Similarly, 


n odd 


1 

2 

1 

2 

1 

4 


L ^ ({^}) 

/ OO -I ^oo 

f{x)dx+- O* {\f"{x)\)dx 

-OO ^ j —OO 



We use these expressions for m < Cq, where Co > 33 is a constant to be computed 
later; they will give us the main term. For m > Cq, we use the bounds on | 5 (m)| that 
Lemma [5.1.2| gives us. 

(Starting now and for the rest of the paper, we will focus on the cases u = 1, 
V = 2 when giving explicit computational estimates. All of our procedures would 
allow higher values of v as well, but, as will become clear much later, the gains from 
higher values of v are offset by losses and complications elsewhere.) 

Let us estimate (|5.28|l. Let 


C,o 


1/6 ift; = l, 
1/3 ift; = 2. 



if u = 1, 
if u = 2, 


Cv,2 = 


55.768.. . 

817.168.. . 


if u = 1, 
if u = 2, 


C,3 


111.536.. . ift; = l, 

1634.34.. . ifu = 2. 


^vA — 


0.0044325... ift; = l. 


^0.038128... 
Then ( 5.28| l equals 


if u = 2, 


Cy 5 — 


0.1079... ifv = l, 
0.2046... ifv = 2. 





(loguS'/s)2 



du, 
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which is 


9 v{rn) ■ log ( 1 + — ) + \g{m)\ ■ O* ( , 


m<Co 


m<Co 


5m^ + 2m + 1 

) 


+ O' + 'll? (<=.,3 + ‘^■Ai + log 10')) + 


+ 0 * 


E 


0 , 4/2 


, s<S/10io 


s(log S/2sY 


for ^ > (Co + 1). Note that Es<s/ioio .(iogS/ 2 .)ii = fo 
Now 


^v,4 


f2/10^ 


1 ^ C„,4/2 


0.00009923... ifr; = l 


2 Jo t(logty log(10io/2) 10.000853636... ifr; = 2. 


and 


(Cv,3 + C„,2(l + log 10®)) + 


106 

For Co = 10000, 


^v,5 — 


0.0006506... ifr; = l 
0.009525... ifr; = 2. 


</>(v) 1 


V 2 


- ^ g„(m) . log 1 + - = 


m<Co 


0.362482... ifr; = l, 
0.360576... ift; = 2, 


V- , / 2 [6204066.5... ift; = l, 

c« 0 / \gv{m)\U>m + 2m + 1) < { 

v,u ^ lyvy ^- 1 15911340.1... ift; = 2, 

m<Co K 


and 


2<Co 

c„,i • (Iog2)/Co = 
Thus, for S > 100000, 


0.00006931... ift; = l, 
0.00017328... ift; = 2. 


1 

E - / gviuS/s)du < 
s<S 
(s,?;) —1 


0.36393 

0.37273 


if n = 1, 
if z; = 2. 


(5.31) 


For S' < 100000, we proceed as above, but using the exact expression ( |5.29| l instead 
of (5.30). Note (5.29i is of the form /s,m.i(S) + /s,m, 2 (S)/S, where both /s,m.i(S) 
and fs,m, 2 {S) depend only on [SJ (and on s and m). Summing over to < S, we obtain 
a bound of the form 


1 /■! 

'E - / gv{uS/s)du < Gv{S) 

(s,?;) —1 
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with 

G,iS)=K^A\S\)+K^,2{\S\)/S, 


where i (n) and can be computed explicitly for each integer n. (For exam¬ 

ple, Gy{S) = 1 — l/5'forl<S'<2 and G„(S') = 0 for S' < 1.) 

It is easy to check numerically that this implies that ([STST) holds not just for S > 
100000 but also for 40 < S < 100000 (if u = 1) or 16 < S < 100000 (if v = 
2). Using the fact that G«(S) is non-negative, we can compare Gv{S)dS/S with 
log(T + 1 /N) for each T G [2,40] n {N a large integer) to show, again numerically, 
that 


G.(S)^ < 


0.3698 log T 
0.37273 log T 


if u = 1, 
ifu = 2. 


(5.32) 


(We use N = 100000 for u = 1; already = 10 gives us the answer above for 
V = 2. Indeed, computations suggest the better bound 0.358 instead of 0.37273; we 
are committed to using 0.37273 because of ( |5.31[ ).) 

Multiplying by 6v/Tr‘^a{v), we conclude that 


/ 3/2 \ 

S^{U.W) O- (5.08((3/2)»^) (5.33) 


if u = 1, 

/ 3/2 \ 

S^^U,W) = ^ ■ (^) + O- (L27C(3/2)»^j (5.34) 

if u = 2, where 

"j,Gi(S) ifl<S<40, 


i7i(S) = 


0.22125 ifS>40, 


H2{s) = 


rG2(S) ifl<S<16, 


0.15107 ifS>16. 


Hence (by (5.32t) 


moreover, 


dS j 0.22482 log T ifu = l, 
1 “ |o.l51071ogr ifu = 2; 

Hi{S) < 4 , H^iS) < 4 


(5.35) 


(5.36) 


(5.37) 


for all S. 


* * * 

Note. There is another way to obtain cancellation on /i, applicable when {x/W) > 
Uq (as is unfortunately never the case in our main application). For this alternative 
to be taken, one must either apply Cauchy-Schwarz on n rather than m (resulting in 
exponential sums over m) or lump together all m near each other and in the same 
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congruence class modulo q before applying Cauchy-Schwarz on m (one can indeed do 
this if S is small). We could then write 

TTL'^W d\m m^W d\m d<U 

m=r mod m=r mod 

and obtain cancellation on d. If Uq> {x/W), however, the error term dominates. 


5.2 The sum 82 ’. the large sieve, primes and tails 


We must now bound 


2 


S2{U',W',W) = E E (logp)e(Q;mp) 

U'<m<^ W'<p<W 
{m,v) = l 


(5.38) 


for U’ = max(C/, a:/2lV), W = max(T/, lV/2). (The condition {p,v) = 1 will be 
fulfilled automatically by the assumption V > v.) 

From a modem perspective, this is clearly a case for a large sieve. It is also clear that 
we ought to try to apply a large sieve for sequences of prime support. What is subtler 
here is how to do things well for very large q (i.e., xjq small). This is in some sense a 
dual problem to that of q small, but it poses additional complications; for example, it is 
not obvious how to take advantage of prime support for very large q. 

As in type I, we avoid this entire issue by forbidding q large and then taking advan¬ 
tage of the error term 5/x \n the approximation a = | + f ■ This is one of the main 
innovations here. Note this alternative method will allow us to take advantage of prime 
support. 

A key situation to study is that of frequencies Ui clustering around given rationals 
a/q while nevertheless keeping at a certain small distance from each other. 


Lemma 5.2.1. Let q> I. Let ai,a 2 , ■ ■ ■ ,otk € K/Z of the form ai = Oi/q + Vi, 
0 < Gi < q, where the elements G K all lie in an interval of length r; > 0, and where 
Gi = Gj implies \vi — Vj\ > ly > 0. Assume v + v < 1/q. Then, for any W,W' > 1, 
W' > W/2, 


E 

2=1 


E! (logp)e(a,p) 

W'<p<W 


< min 1 


2q 1 

log((q(iy + v))-^) 


(W-W' + iy-^) i^ogpf. 

W'<p<W 


(5.39) 


Proof For any distinct i, j, the angles aj are separated by at least v (if = Gj) or 
at least l/q—\vi—Vj \ > 1/q — v > v {if Gi f Gj). Hence we can apply the large sieve 
(in the optimal N + 5~^ — 1 form due to Selberg IISel91l and Montgomery-Vaughan 


IIMV74I ') and obtain the bound in (5.39 1 with 1 instead of min(l,...) immediately. 
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We can also apply Montgomery’s inequality ( 0Mon681 . 0Hux72l : see the exposi¬ 


tions in IIMon? 1 1 pp. 27-29] and OIK04I §7.4]). This gives us that the left side of (5.39 1 
is at most 


-1 


E 

r<R 
\{r,q) = l 


{Tir)y 

0(r) 


/ 


E E E 

r<R a' mod r i—1 
(r,q) = l {a',r) = l 


E (log7')e((ai + a'/r)p) 
W'<p<W 


(5.40) 


If we add all possible fractions of the form a'/r, r < R, {r,q) = 1, to the fractions 
tti/q, we obtain fractions that are separated by at least 1/qR^. If v -\- v > 1/qR?, then 
the resulting a ngles ai + a'/r are still separated by at least v. Thus we can apply the 
large sieve to (5.40l; setting R = Ij ^/{vS^v)q, we see that we gain a factor of 




r<R 


^(r) q m q 2q 

since J2d<R log(i?) for all R > 1 (integer or not). 


(5.41) 

□ 


Let us first give a bound on sums of the type of S 2 {U,V,W) using prime sup¬ 
port but not the error terms (or Lemma 5.2.1 1 . This is something that can be done 
very well using tools available in the literature. (Not all of these tools seem to be 


known as widely as they should be.) Bounds (5.42 1 and ( |5.44| i are completely standard 
large-sieve bounds. To obtain the gain of a factor of log in ( 5.43| l, we use a lemma 
of Montgomery’s, for whose modern proof (containing an improvement by Huxley) 
we refer to the standard source IIIK04I Lemma 7.15]. The purpose of Montgomery’s 
lemma is precisely to gain a factor of log in applications of the large sieve to sequences 
supported on the primes. To use the lemma efficiently, we apply Montgomery and 
Vaughan’s large sieve with weights IIMV73I (1.6)], rather than more common forms of 
the large sieve. (The idea - used in 0MV73I to prove an improved version of the Brun- 
Titchmarsh inequality - is that Farey fractions (rationals with bounded denominator) 
are not equidistributed; this fact can be exploited if a large sieve with weights is used.) 


Lemma 5.2.2. Let W > 1, W' > W/2. Let a = a/q + 0*(l/qQ), q < Q. Then 


E 

AQ<.m<Ai 


{\ogp)e{amp) 

W'<p<W 

Ai — Aq 


< 


min(g, [(5/2]) 


(5.42) 


{W -W + 2q) E (logp)2. 

W'<p<W 
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If q < W/2 and Q > 3.5M^, the following bound also holds: 


E 

Aq <m< Ai 


(log p)e (amp) 

W'<p<W 
Ai — Aq 

q 


(5.43) 


< 


q W 
f(q) log(W/2q) 


W'<P<W 


If A, - ylo < gq and q < pQ, g, p € [0,1], the following bound also holds: 


E 

Aq <m< Ai 


(logp)e(amp) 

W'<p<W 

< {W-W'+ q/{l-gp)) (logp)^. 

W’<p<W 


(5.44) 


Proof Let A: = inin(5, [Q/21 ) > |"g/2]. We split (Ao,^i] into |"(Ai—Ao)/fc] blocks 
of at most k consecutive integers mo + 1, mo + 2,.... For m, m' in such a block, am 
and am' are separated by a distance of at least 

\{{a/q){m - m')}\ - 0*{k/qQ) = 1/q - 0*{l/2q) > l/2q. 

By the large sieve 


E 


E (logp)e(a(mo + a)p) 
W'<p<W 


<iiW-W') + 2q) (logp)^. (5.45) 

W><p<W 


We obtain (5.42 1 by summing over all [[Ai — Aif)/k'\ blocks. 


If Ai — Aq < \gq\ and q < pQ, g, p € [0,1], we obtain (5.44i simply by applying 
the large sieve without splitting the interval Aq < m < Ai. 

Let us now prove ( 5.43| l. We will use Montgomery’s inequality, followed by Mont¬ 
gomery and Vaughan’s large sieve with weights. An angle a/q + a'ljri is separated 
from other angles a'/q + a' 2 /r 2 {ri,r 2 < R, {ai, ri) = 1) by at least XjqrxR, rather 
than just I/qR?. We will choose R so that qR^ < Q; this implies 1/Q < 1/qR? < 
l/qriR. 

By a lemma of Montgomery’s IIIK04I Lemma 7.15], applied (for each 1 < a < g) 
to S'(a) = o„e(an) with a„ = log(n)e(a(mo + a)n) if n is prime and a„ = 0 
otherwise. 


f{r) 


E (logp)e(a(mo -f a)p) 

W'<p<W 


^ E 


a' mod r 
{a' ,r) — l 


W'<p<W 


a (mo + o) + — ) P 


(5.46) 
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for each square-free r < W. We multiply both sides of (5.46 1 by 


W f I 1 


2 \ qrR Q 


- 1 ' 


and sum over all a = 0,1,..., g — 1 and all square-free r < R coprime to g; we will 
later make sure that R < W'. We obtain that 


E 

r<R 
(r,g) = l 


W Z f I 1 
~2^2 ~ Q 




(5.47) 


E 


(logp)e(a(mo -f a)p) 

W'<p<W 


is at most 


E 

r<R 

r sq-free 


w z f l 1 


2 \qrR Q 


(5.48) 


E E 

a—1 a' mod r 
{a' ,r) — l 


W'<p<W 


i (mo -I- a) H- ] p 


We now apply the large sieve with weights 0MV73I (1.6)], recalling that each angle 
a(mo + a) + a'/rm separated from the others by at least 1 /qrR — 1 /Q\ we obtain that 


(5.48 


is at most J2w'<p<w(^^&PE- remains to esti 


estimate the sum in the first line of 


(5.4 / 1 . (We are following here a procedure analogous to that used in 0MV73I to prove 


the Brun-Titchmarsh theorem.) 

Assume first that q < W/lZ.b. Set 


R=[a 




1/2 


q 


(5.49) 


where ct = l/2e2°-25068 ^ 0.30285 .... It is clear that qR^ <Q,q<W' &MR> 2. 
Moreover, for r < R, 


1 

— < 


1 


< 


cr 1 ^ (t/3.5 


Hence 

W 

T 


Q - 3.5kk - Z.5 aW Z.5 qR^ “ qrR ' 

3^1 1 vE ^ ^ ^ 


2 \qrR Q 




3(j 




3r 


W 

2 ' 2 1-a/3.5 “ 2 ' 2(l- r'^'^T 


1 — a/Z.5 R 


< 


^ (l + 


R 


-1 
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and so 


V ^ 2 \qrR QJ j (j){r) 

{r,q) = l 



E 

r<R 




4>{r) 


ir,q) = l 


> 


2 (p{q) 
W q 


r<R 


Tirf 
^(r) ' 


For E>2, 


+ > logi? +0.25068; 

r<R 


this is true for R > 100 by IIMV731 Lemma 8] and easily verifiable numerically for 
2 < R < 100. (It suffices to verify this for R integer with r < R instead of r < i?, as 
that is the worst case.) 

Now 

1 / IL \ 1 W 

log-R = 2 (log ^ + log2CTj = 2 ^ ~ 0.25068. 


Hence 




and the statement follows. 

Now consider the case q > lF/13.5. If q is even, then, in this range, inequality 


(5.42 1 is always better than (5.431, and so we are done. Assume, then, that VF/13.5 < 
q < W/2 and q is odd. We set R = 2; clearly qR^ <W < Q and q < W/2 < W', 


and so this choice of R is valid. It remains to check that 

1 1 

H- 


IV I 3 
2^2 


(i-i) 


-1 


[h-h] 


W I 3 / _L_ 

2 2 I ^ 


1 , w 


This follows because 


+ 3(t_ J_\- 
^ 2 [2 3.5/ 


2 ^ 2 [4 3.5/ 


3T > log - 


for all 2 < f < 13.5. 


□ 


We need a version of Lemma [5.2.2| with m restricted to the odd numbers, since we 
plan to set the parameter v equal to 2. 


















98 


CHAPTER 5. TYPE II SUMS 


Lemma 5.2.3. Let W > 1, W > Wjl. Let 2a = ajq + 0*(\lqQ), q < Q. Then 


E 


Ao<m<Ai 
m odd 


{\ogp)e{amp) 

W'<p<W 


(5.50) 


< 


A-i — Aq 


(W-W' + 2q) E (logp)^ 

W'<p<W 


miii(2g, Q) 

If q < W/2 and Q > 3.5VL, the following bound also holds: 


E 


Aq <m< Ai 
m odd 


{logp)e(amp) 

W'<p<W 


(5.51) 


< 


Ai — ^0 
2 ? 


q 


w 


fiq) \og(W/2q) 




W'<p<W 

IfAi - ^ < 2gq and q < pQ, Q, p & [0,1], the following bound also holds: 


E 

Aq <m< Ai 


E (logp)e(amp) 

W'<p<W 

<{W -W + q/{l- QP)) (logp)^ 

W'<p<W 


(5.52) 


Proof We follow the proof of Lemma [5. 2. 2[ noting the differences. Let 

k = min(g, \Q/2'\) > \q/2'\, 

just as before. We split {A^, Ai] into [{Ai — Ao)//c] blocks of at most 2k consecutive 
integers; any such block contains at most k odd numbers. For odd m, m' in such a 
block, am and am' are separated by a distance of 


\{a(m — m')}\ = 


2a- 


= \{{a/q)k}\-0*{k/qQ)>l/2q. 


We obtain (5.501 and (5.52 1 just as we obtained (5.42 1 and (5.44 1 before. To obtain 
( 5.51| l, proceed again as before, noting that the angles we are working with can be 
labelled as a{mQ + 2a), 0 < a < q. □ 

The idea now (for large S) is that, if S is not negligible, then, as m increases and 
am loops around the circle K/Z, am roughly repeats itself every q steps - but with a 
slight displacement. This displacement gives rise to a conbguration to which Lemma 


5.2.1 is applicable. The effect is that we can apply the large sieve once instead of many 


times, thus leading to a gain of a large factor (essentially, the number of times the large 
sieve would have been used). This is how we obtain the factor of |<5| in the denominator 
of the main term x/\5\q in (5.56 1 and (5.57 1 . 
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Proposition 5.2.4. Let x > W > 1, W > W/2, U' > xj2W. Let Q > 3.5W. Let 
2a = ajq + Sjx, {a,q) = 1, \5/x\ < IjqQ, q < Q- Let S 2 {U',W',W) be as in 
( |5.5iSP with V = 2. 

For q < pQ, where p G [0,1], 


S 2 {U',W',W) < ( max(l,2p) 

Ifq < WI2, 

S2(U',W\W) < 


X X 


\ W 
8q ' 2wJ 


2q 


W'<p<W 

(5.53) 


W 


{4(p(q)log(W/2q) ' (/>(q) log(W/2qj} 


f X 


(logp)^ 

W'<p<W 


IfW > xjAq, the following bound also holds: 

, , fW r, 

S2{U',W',W)< — + 


2 1 - x/4Wq 

If 5 f 0 and xjAW + q < x/\d\q. 


W'<p<W 


(5.54) 


(5.55) 


S 2 iU',W',W) < min 1, 


2q/f(^q) 


log(^(g+4^) 

+ E (loEpf. 

' '' W'<p<W 


(5.56) 


Lastly, if S f 0 and q < pQ, where p G [0,1). 
x 

\H 


S2{U',W',W)< + ^ + 


8(1-p)Q A(l-p)W 


W'<p<W 

(5.57) 


The trivial bound would be in the order of 


S 2 {U\W\W) = {x/2\ogx) Y (logP)' 

W'<p<W 


In practice, (5.55 1 gets applied when W > x/q. 

Proof Let us first prove statements (5.54 1 and ( |5.53p , which do not involve 5. Assume 
first q < W/2. Then, by (5.511 with Aq = U', Ai = x/W, 


S2{U/W',W) < 


c/W-U' 


+ 1 


W 


f(q) log(W/2q) 


Y (logp)^- 


Clearly (x/W - U')W < (x/2W) ■ W = x/2. Thus (5.54 1 holds. 
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Assume now that q < pQ. Apply (5.50i with Aq = U', Ai = xjW. Then 
xjW -U' 


S 2 {U', W', W)<{ +]\(W-W' + 2q) Y. (logp)'- 

Vg-mm(2,p 1) ) 


Now 


xjW -U' 




< 


q min(2, p 

- . + max(l, 2p)^ +W/2 + 2q. 

qmm[2,p 21V 


This implies ( |5.53| l. 

If > x/4q, apply (5.44i with g = x/'IWq, p = 1. This yields ( |5.55| l. 

Assume now that S ^ 0 and x/4W + q < x/\5q\. Let Q' = x/\5q\. For any mi, 
TO 2 with xl2W < mi,m 2 < x/W, we have |mi — m 2 | < xl2W < 2{Q' — q), and 
so 


mi — m2 


Sjx P qSjx 


<Q'\5\/x = -^. 


(5.58) 


The conditions of Lemma 5.2.1 are thus fulhlled with v = {x/AW) ■ \5\/x and v = 
|(5g|/a:. We obtain that S 2 (U', W', W) is at most 

2q 1 


min 1 


(t>{q) \og{{q{v + v))- 
-1 


^ {w-w' + ^-1) ^ (logp)2. 


W'<v<W 

Here W — W' + =W— W' + x/\qS\ < W/2 + x/\q6\ and 

-1 


{q{iy + v))-^ = q 


.1^1 


) (^ 


4IF/ 


Lastly, assume S ^ 0 and q < pQ. We let Q' = x/\5q\ > Q again, and we split 
the range U' < m < x/W into intervals of length 2{Q' — q), so that (5.58 1 still holds 
within each interval. We apply Lemma 5.2.1 with v = [ff — q) ■ \5\/x and v = \5q\/x. 
We obtain that S 2 {U', W', fF) is at most 

' W'<p<W 


Here W — W' + v ^ < IF/2 + x/q\5\ as before. Moreover, 


IF X \ 


1 + 


x/W-U 

2(Q' - q) 


x/2W 

2(1-P)Q' 


IF X X 

<- \- O I -1- 

- 2 ^ ^ ^ 8(1 - p)Q' 4IF(1 - p) 

X W X X 

- ^ + Y + 8(1 - p)Q + 4(1 - p)IF ■ 


Hence ( |5.57| i holds. 


□ 





























Chapter 6 

Minor-arc totals 


It is now time to make all of our estimates fully explicit, choose our parameters, put 
our type I and type II estimates together and give our final minor-arc estimates. 

Let cc > 0 be given. Starting in section 6.3.1 we will assume that x > Xq = 


2.16 • 10^°. We will choose our main parameters U and V gradually, as the need arises; 
we assume from the start that 2 • 10® < V < x/A and UV < x. 

We are also given an angle a £ M/Z. We choose an approximation 2q; = a/q + 
S/x, {a,q) = 1, q < Q, \5/x\ < AjqQ- The parameter Q will be chosen later; we 
assume from the start that Q > max(16, 2y/x) and Q > max(2[/, xjU). 

(Actually, U and V will be chosen in different ways depending on the size of q. 
Actually, even Q will depend on the size of g; this may seem circular, but what actually 
happens is the following; we will first set a value for Q depending only on x, and if 
the corresponding value of g < Q is larger than a certain parameter y depending on x, 
then we reset U, V and Q, and obtain a new value of q.) 

Let Si,i, Si^ 2 , Sn, Sq be as in (3.91, with the smoothing function p = 772 as in 
( |3.4| l. (We bounded the type I sums Si,i, Si ^2 for a general smoothing function 77 ; it is 
only here that we are specifying 77 .) 

The term Sq is 0 because V < x/A and 772 is supported on [—1/4,1]. We set v = 2. 


6.1 The smoothing function 

For the smoothing function 772 in p.4[ ), 

\V 2 \i = l, |77^|i = 81og2, |?7 "|i=48, (6.1) 

as per 0Taol41 (5.9)-(5.13)]. Similarly, for 772 ,p(f) = log(pf)r 72 (f), where p> A, 

\m,p\i < iog(p)|?? 2 |i = log(p) 

1^2, pit = 2772 ,p(1/2) = 21og(p/2)772(l/2) < (81og2)logp, 

I 'l^'p 1 1 = 4 log (p/4) + 12 log p - 4 log(p/4) I + 14 log 2 - 4 log p I 
-I- I log p - 4 log 2| + I log p| < 48 log p. 
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In the first inequality, we are using the fact that \og{pt) is always positive (and less than 
log(/o)) when t is in the support of 772 . 

Write log’*' x for max(log x, 0). 


6.2 Contributions of different types 

6.2.1 Type I terms: S'/ I. 


The term Sj^i can be handled directly by Lemma 4.2.3 with po = 4 and D = U. 


(Condition (4.38 1 is valid thanks to (6.2i.) Since U < Q/2, the contribution of S '/1 
gets bounded by (4.40[i and (AA\\: the absolute value of S/p is at most 


— mm 1 , -— 
q \'{2^Y) 

y log 

m mq 

m<^ 

+ *|log-7(-^)l 
q 

/t(m) 

m 

m<^ 




(m.g) = l 


2-^/ CqCi 
TT 

3ci j2x 


fc/ log = + log ’ + ? log ’ log+ ®^ log ’ logH 

\ U C 2 2 C 2 q J 2q C 2 


U 

C2X 

g 


2x 




01/2^ 

~ir 


( 1 ,log logx + log 


AT:\ri'\ix 


Stt^ 


C2 


(6.3) 

where cq = 31.521 (by Lemma|BX3]l, ci = 1.0000028 > 1 + (81og2)/C > 1 + 
(81og2)/(ir/(7) andc 2 = 677 / 5 ^^ = 0.67147 .... By (|2^ (with fc = 2), ( |BT7| ) and 
Lemma |B.2.4| 

|log-r7(-(5)| < min ^2 - log 4, 


By (2.201, (2.221 and (2.231, the first line of (6.3 1 is at most 

. f 4 g/S(<?) 






^ c'o' ^ .( A q/(j}{q) \ 

where = 0.798437 > co/(27r)2, c'^ = 1.685532. Clearly Sp'/co > 1 > 2 - log4. 

Taking derivatives, we see that t 1 —)■ (t/2) log{t/C 2 ) \og^ 2U/1 takes its maxi¬ 
mum (for t S [1,2(7]) when log(f/c 2 ) log''" 2(7/f = logf/c 2 — log^2(7/f; since 
t log f/c 2 — log’*' 2U/t is increasing on [1, 2(7], we conclude that 
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Similarly, t ^ t \og(x/t) \og^{U/t) takes its maximum at a point t S [0, ?7 for which 
\og{x/t)\og^ {U/t) = log(a::/f) F\og^ {U /t), and so 

a; , q , + U C/ ,, , rrx 

- log — log^ — (log X + log U). 

q C 2 — C 2 


We conclude that 


\Sis \ < - min ( 1, ^ 


. f 4g/0(q) 

iin - inT- 

\5log+^ 


+ C 7 J log — + csj log X max 1, log 


C2 


(log 




Jj J *^ 4,7 


X 


4>{q) 


9 + Cio./;;,— log^:^ 
iqx U 


, f 1 2t/ , , tt\ TT , m 2yex cio,/ 

+ C 5 J log-h C6,/ \ 0 gxU U + CQJ^Jx\ 0 g -1- 

V C2 J C2 e 


(6.4) 


where C 2 and Cq are as above, j = 2.11104 > Cq/cq, C 4 j = 1.00303, C 5 / = 
3.57422 > 2Y^coCi/7r, cqj = 2.23389 > cyj = 6.19072 > 2^/icQCl|^T, 

C 8 ,/= 3.53017 > 2(8log 2 )/ 7 r, 


cgj = 19.1568 > 


3-\/2ci 


37r2 


cio,/ = 9.37301 > co(l/2 — 2 / 7 r^) and cnj = 9.0857 > I(Fk ■ 8 log 2). 


6.2.2 Type I terms: 5/ 2 . 

The case q < Q/V. If g < Q/V, then, for v <V, 


2va = -h O 


— +n* ( — ] = ^ 


\Qq 




and so va/q is a valid approximation to 2va. (Here we are using v to label an integer 


variable bounded above by u < 14; we no longer need v to label the quantity in (3. 101 , 
since that has been set equal to the constant 2.) Moreover, for Qy = Q/v, we see that 
2va = {va/q) + 0*{l/qQy). If a = a/q + 5/x, then va = va/q + 5/{x/v). Now 


Si ,2 = E''(») E /r(m) E e{{va) ■ mn)r]{mn/{x/v)). (6.5) 

v<V m<U n 

V odd m odd o4d 

We can thus estimate Si ,2 by applying Lemma |4.2.2| to each inner double sum in ( |6.5[ ). 
We obtain that, if |5| < l/ 2 c 2 , where C 2 = 67 r/ 5 y^ and cq = 31.521, then |S' 7 _ 2 | is 
at most 


Ea(^) 

v<V 


( 

V 



Ejm) 

m 

m<Mylq 
{m,2q) — l 



Ciojq 

Ax/v 





( 6 . 6 ) 
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plus 


» / \ f 2-v/coc+ 3c+ x _^_U v^coc+ U 

> A u - U + — -log+ — + log+ — 

\ TT 2 vq,^ TT 

i;<V \ VQv 


qv/‘2 


, ST Af f} cii, 7 g 2 \ /2V3coc+ 3c+ 55coC2\ \ 

+ Hv) (^c„ m„ ^log —, Ij + 2^7 + j j ■ 

(6.7) 

where = q/{q,v). My G [min{Q/2v,U),U] and c+ = 1 + {8\og2)/{x/UV); if 
|(5| > 1 / 2 c 2 , then |S'/, 2 | is at most (6.6i plus 


V^^TT , I o , + ^) 1 + 2C7 ^ a; ^ 35 coC2 

e J£A Q 37r2 

\S\q. / ^ 






v<V 


TT I2 




+ 1, 2U I I ^3 + 2e + 


log 


+ 2U 


L^J+1 


Write Sv = J2v<v Mv)l{vqy). By (2.12 1 , 


( 6 . 8 ) 


Sy < _|_ A(u) / (g, v) _ 1 

^ ~ vq ^ V \ q q 

v<V v<V 

(v,q)>l 


log V 1 ,, , 

< -+ -^(logp 

n 


q q 


p\q 


3(7) + X/ 


V 


1 


- V 

P“ P' 


1 


< = 


q q 


p\q 


a>l ^ a>l 

pQ + «p(g)<y p°‘<V j 

log Vq 

q 


(6.9) 


This helps us to estimate ( 6 . 61 . We could also use this to estimate the second term in 


the first line of ( |6.7[ ), but, for that purpose, it will actually be wiser to use the simpler 
bound 

A(u)-log+ ^ A(u)^— < - UV (6.10) 

n,n. e eC 2 


v<V 




(by (2.14i and the fact that flog'*’ A/t takes its maximum at f = A/e). 
We bound the sum over m in (|6.6|l by \2.2Q\ and (|2.22|i: 


E 


p(m) 


ra<My ! q 
(m,2q) = l 


< min 


4 q/c/jq) 
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To bound the terms involving ({7/q„ + l)^,we use 

Hv)v < 0.50041/2 (by 

v<V 

Y,K{v)v{v,qy < Y, K{v)v + V E A(v)(v,gy’ 


v<V 


v<V 


v<V 






v<V 


p\q 


l<Q<log V 


p\g 


< (logF) E P^pU) < glogl/ 

p\q 


and 


„Vp(q)+a 


E A(u)(t;,g)2 < ^(logp) Y P" 

v<V p\q y 

0,q)^l 

< ^(logp) • < 2gl/logg. 

p\q 

Using p.l4|i and (|6.9|l as well, we conclude that (|6.6|l is at most 


^ • I 1 

— mm 1 


2g 


Co 


{irsy 


■ ^ 

mm - 




Cio,/ 


U 


+ ^ 0.50041/2(7 ( - + 1 ) + 2UVg log U + 2(721/ log U | . 


Assume Q < 2UVje. Using (2.14 1 , (6.10 1 , (2.18 i and the inequality vg < Vg < Q 
(which implies g/2 <U /e), we see that (6.71 is at most 

' / 2ycoc+ 3c+ 


1.0004 


V TT 2eC2 


UV + ^Q,o,y- 

TT g/2 


+ ( C 5,72 max ( log *^“77 ^ ^2 J + cqj^ ) 


where = 3.53312 > 1.0004 • csj and 

/ 2i/3coC+ 3c+ 55 coC2\ 


C 6./2 = 1.0004 


V TT 2c2 OtT^ / 


( 6 . 11 ) 


The expressions in (6.8 1 get estimated similarly. The hrst line of (6.8 1 is at most 

1.0004 (h^uv + ^ (2 + i±i bg+ ?//&) A + 

\7r 2\ € X J Q Stt^ J 
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by (2.14i. Since q < QjV, we can obviously bound qV by Q. As for the second line 

ofdelT- 




v<V 




_\S\qv_ 


+ 1,2U] •-log+ 


2U 


\S\q. 


+ 1 


, sU 1.0004^^^, 

< } A(u) max flog — < > Aiu)— =- UV, 

t>o t e e 

v<V v<V 


but 


^A(u)i 






+ l,2t/)< Hv)-2U 


V<. 


— 2U\8\q 


V A(u)^ + yA(u)+ y A(u) 

I'? ^ \<lv 9 


x/\5\ ( \ 1 


-<v<V 


{v,q)^l 


v<V 


v<V 


< 1 - 03883 ^ + fi , ('“8 aj* + ‘“8 " 

X 1 


+ 1.0004^ + — - y (logp)up(g) 
|(5| q ^ 


p\q 


< y f 1.03883 + logg + log+ + 1.0004^ 

\S\q V V2a; J 


by (2.12i, (2.13 1 , (2.14i and (2.15|l; we are proceeding much as in (|6.9[). 


Let us collect our bounds. If |<5| < l/ 2 c 2 , then, assuming Q < 2UV/e, we con¬ 
clude that [S'/ 21 is at most 


2 ^”“ ('• (Jj^) (lyr-' 1 '“ 8 r'! 


C 8./2 


a; fUVY 


X ) 


Yh) 


iVq 

y CIO./ (UV 


U'^V 

q log V H-log V 

X X 


( 6 . 12 ) 


plus 


{c4j2+Cgj^)UV + {cio,i 2 log —-I-C 5./2 max 

q 



ciijq'^V 



+012,72)-Q, 


(6.13) 


X 
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where 


C 4 J 2 = 3.57565(1 + eo) > 1-0004 • 2.^coC_|_/7r, 

C5,/2 = 3.53312 > 1.0004- C8,/, 

C8,/2 = 1-17257 > ^ - 0.5004, 

eg,/, = 0.82214(1 + 2eo) > 3c+ - 1.0004/2ec2, 

C10./2 = 1-78783V1 + 2eo > 

C 12./2 = 29.3333 + 11.902eo 

> 1.0004 + 1-78783(1 + eg) log2 

= C6,/2 + Cio. 72 log 2 


and cio,/ = 9.37301 as before. Here eg = (41og2)/(a;/C/y), and cgjj 1^ in (6.11 1 . 
If |<5| > 1 / 2 c 2 , then is at most (6.12 1 plus 


. 1 + e, +2UV\6\q\ X 

+ C 16./2 2 +-log+-' ' 


where 


(c 4,/2 + (1 + e)ci3,/Jt/y + Q ^Ci4j2 ^logg + log+ ^+ O 15 , 

) Q /l^ 017 , 72 !? F Cg - 04 ^/ 214 , 

1 ^ 1-0004 

013.72 = 1-31541(1 + eg) > - - -, 

TT e 

C14.72 = 3.57422V1 + 2eg > 

TT 

ci 5 72 = 3.7130 VI + 2eg > . 1.03883, 

’ TT 

= 1.5006(1 + 2eo) > 1.0004 - 3c+/2 
35cgC2 


X 


(6.14) 


Ci 7 = 25.0295 > 1.0004 ■ 


37r^ 


and Cg = (1 + e)-\/3 + 2e. We recall that C 2 = 67 r/ 5 y+g = 0.67147.... We will 
choose e S ( 0 , 1 ) later; we also leave the task of bounding eg for later. 


The case q > Q/V. We use Lemma 4.2.4 in this case. 


6.2.3 Type II terms. 

As we showed in (5.1 1 - (153J, Sii (given in ( |5.1| l) is at most 

, - dW , - dW 

4 VSi{U,W)-S2 {U,V,W) — +4 ^Si{U,W)-S^{W) — , (6.15) 


where Si, S 2 and are as in ( |5.4| l and ( |5.5| l. We bounded Si in ( |5.33| ) and ( |5.34| l, S 2 
in Prop. |5.2.4| and S 3 in ( |5.5[ ). 



































108 


CHAPTER 6. MINOR-ARC TOTALS 


Let us try to give some structure to the bookkeeping we must now inevitably do. 
The second integral in (6.15 i will be negligible (because S 3 is); let us focus on the first 
integral. 

Thanks to our work in j ]5.1| the term Si{U,W) is bounded by a (small) constant 
times x/W. (This represents a gain of several factors of log with respect to the trivial 
bound.) We bounded 52 ((7, V, W) using the large sieve; we expected, and got, abound 
that is better than trivial by a factor of size roughly ^ y/q log x - the exact factor in 
the bound depends on the value of W. In particular, it is only in the central part of the 
range for W that we will really be able to save a factor of ^ y/q log x, as opposed to 
just ^ y/q. We will have to be slightly clever in order to get a good total bound in the 
end. 


We first recall our estimate for Si. In the whole range \V,x/U] for W, we know 
from (5.33 1 , (5.34i and (5.37 1 that Si{U, W) is at most 


where 




kq — 1.27. 


x/WU 


U 


(6.16) 


(We recall we are working with v = 2.) 

We have better estimates for the constant in front in some parts of the range; in 
what is usually the main part, ( |5.34| i and (5.36i give us a constant of 0.15107 instead 
of 2/7r^. Note that 1.27C(3/2)^ = 22.6417.... We should choose U, V so that the 
first term in (|6.16[) dominates. For the while being, assume only 


L > 5 • 10® 


vu' 


then (6.16 1 gives 


where 


Si{U,W) < Ki 


W’ 


(6.17) 

(6.18) 


22.6418 


TT^ 


< 0.2347. 


^lW2 

This will suffice for our cruder estimates. 

The second integral in ( |6.15 1 is now easy to bound. By ( |5.5| l, 

53 (IL) < 1.0171X + 2.0341IL < 1.0172a:, 

since W < xjU < x/h ■ 10®. Hence 


IV 


, - dW 

v/5i(C/,tL)-53(IL) —<4^ 

< Kg 


n/U 


V 

X 

w 


X dW 
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where 


Kg 


= 8 • Vl-0172 • Ki < 3.9086. 


Let us now examine Sg, which was bounded in Prop. 5.2.4 We set the parameters 
W, U' as follows, in accordance with (|5.4|l: 


W' = max(V, W/2), U' = max(t7, x/2W). 
Since W > VL/2 and W >V > 117, we can always bound 

(logp)^ < ^lL(logVL). 


(6.19) 


W'<p<W 

by (|09]l. 

Bounding Sgfor 6 arbitrary. We set 

Wo = min(max(26*g, L), x/L), 

where 0 > e is a parameter that will be set later. 

For V < W < Wo, we use the bound ( |5.53| i: 

S 2 {U',W',W)< (maxil,2p) (^ + ^) + 2q ] ■ ^-WilogW) 


1 


8 q 2W 


W 1 


< max (-,p) ( — + -)a;loglL + 


W^ logVF 


qW log W, 


where p = q/Q. 

If Wq > V, the contribution of the terms with V <W < Wo to ( |6.15| l is (by |6.18| l 
bounded by 


rWo 


' Ki - 


X f po fW 


VF V 4 \iq 


+ 1 ) a; log W + 


W2 log W 


^K 2 ^ V^ogW K 2 VlogW^ 

-h737^"+T^X. 


K 2 \ 


IPqx^ 

I6q 


IV 

^/WW 




qW log W 


dW 


dW 

W 


+ qx 


W 


dW 


( 6 . 20 ) 


< + Kg'/xWo^ y/logWo 

^^+ 9 ^ ((logW^o)"/' - (logL)"/") , 


2k 2 I PoX 
where po = max(l, 2 p) and 


K2 = 4i/ki < 1.93768. 

(We are using the easy bound \/a + h + c < y/a + Vb + yfc.) 
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We now examine the terms with W > Wq. If 29q > x/U, then Wq = x/U, the 
contribution of the case is nil, and the computations below can be ignored. Thus, we 
can assume that 29q < x/U. 

We use ( |5.54| l: 


S 2 {U/W/W) < 


w 


4(/>(q)log(W/2q) ' (/>(q) log(W/2q)} 2 


WlogW. 


By y/a+l) < \fa + >/&, we can take out the q/4>{q) ■ W/ \og{W/2q) term and estimate 
its contribution on its own; it is at most 


x/U 




lvp2. dW 


'Wo 


W c/)(q) 2 logW/2q W 

K 2 I q I xlogW 

V Jwo V ^ W/2q 

f-x/U ‘ ~ 


dW 


^ jqx^ j-'~ _J_ / / log2g \ 

“ \/2 V Jwo VW y V log W/2q j 


dW 


( 6 . 21 ) 


Now 


x/U 


1 


log2g 


■dW = 


’./lUq 


1 


Jwo VW^^OgW/2q Jxaayi(S,V/2q) \/t\ogt 

We bound this last integral somewhat crudely: for T > e, 


dt. 


1 / T 

dt < 2.3 


Vt log t 


logT’ 


( 6 . 22 ) 


(This is shown as follows: since 


< 2.3, 


T 


VTlogT ' \"^\l logT^ 
if and only if T > Tq, where Tq = ^ = 2135.94 ..., it is enough to check 


(numerically) that (6.22i holds for T = Tq.) Since 9 > e, this gives us that 

px/U / 


log 2 q \ 


I Wo VW 
< 2 


Y logH^/2g I 
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and so \6.2\\ is at most 


/ log2g \ X 

m \ ■ pogx/2Uq) Vu' 


We are left with what will usually be the main term, viz., 

fx/U I 


4/ 


>Wo 


W- 


8 i/)(g) logW/2gJ W’ 


(6.23) 


which, by (5.34i, is at most x/^(j){g) times the integral of 


W 




2II, 


(—) 

\wuJ 


k4 x/WU\ logM^ 


U y logVF/2(7 


for W going from Wq to x/U, where H 2 is as in (5.35 1 and 

K4 = 4koC(3/2)^ < 90.5671. 

By the arithmetic/geometric mean inequality, the integrand is at most 1/W times 

13 +13~^ ■ 2H2 {x/WU) I3~^ Ki Ix/WU P log2g 


2 2 


U 2 1ogM^/2g 


(6.24) 


for any /? > 0. We will choose /3 later. 

The first summand in ( 6.24| l gives what we can think of as the main or worst term 
in the whole paper; let us compute it first. The integral is 


P + p-^- 2H2 {x/WU) dW p + p-U 2H2 {s) ds 


'Wo 


W 


2 

X 


- [2 4p) ^°^ UWo 


(6.25) 


by (5.36 1 , where 


K6 = 0.60428. 


Thus the main term is simply 


P Ke 


X . X 

log- 


,2 4P) ^UW^' 

The integral of the second summand is at most 


(6.26) 


1 K4 dW 1 Ki Ix/UV 

^ '^irjy -Yy-[r- 
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By (6.17 1 , this is at most 


^.10-3 


where 


K7 = 


K4 < l3 ^kt/2, 

'/ 2 k 4 
1000 


< 0.1281. 


Thus the contribution of the second summand is at most 

/3~^Kr X 

2 \/^M 


The integral of the third summand in (6.24 1 is 


/3 log2g dW 

2 7„.„ \ 0 gW/ 2 qW 


(6.27) 


If 17 < 20q < x/U, this is 


P \og2qdW _ B 

‘ 2 J 20 q \ 0 gWl 2 qW 

= ^ log 2q ■ 


rx/2Uq ^ 

9 log f t 

X 

log log - log log 6 » 


If 29q > x/U, the integral is over an empty range and its contribution is hence 0. 
If 29q < 17, ( |07l ) is 

13 log2g dW /3 log 29 ^ 


2 Jy \ogW/ 2 q W 


2 

13 log 2 q 
2 

(3 log 2 g 
2 


V/2q 


logf t 


(log log — - log log v/2q) 

, / loga:/1717\ 

logF/ 2 g)- 


(6.28) 


(Let us stop for a moment and ask ourselves when this will be smaller than what 
we can see as the main term, namely, the term (/3/2) Xogx/UWQ in (6.251. Clearly, 
log(l + (log a::/(717)/(log 17 / 29 )) < (log a:/(7F)/(log C/ 29 ), and that is smaller than 
(loga;/t/ 17 )/log 29 when C /29 > 2q. Of course, it does not actually matter if (6.28 1 
is smaller than the term from ( |6.25| l or not, since we are looking for upper bounds here, 
not for asymptotics.) 

The total bound for ( |6.23| ) is thus 




/?' 






-1 




2 


(6.29) 
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where 


$ = 


(log 2q (^log log ^ - log log if V/20 <q< x/{29U). 
I log 2g log (l + ‘rogy 72 g ) if g < 1^/20. 


(6.30) 


Choosing optimally, we obtain that (6.231 is at most 



(6.31) 


where is as in ( |6.30| l. 

Bounding S 2 for |(5| > 8. Let us see how much a non-zero 6 can help us. It makes 


sense to apply (5.56 1 only when |i5| > 8; otherwise (5.54 1 is almost certainly better. 


Now, by definition, \5\/x < I/qQ, and so |(5| > 8 can happen only when q < x/8Q. 


With this in mind, let us apply (5.56 1 , assuming |(5| > 8. Note first that 


X 

l<^9l 


— 1 
4WJ 


-1 


> 


l/l^9l ^ 4/|5g| 


1 — 
iW 


1 

2Q 


> 


41L 

I<5|9 


> 


w 

4W 


1 + ^-1% 1 + ^- 


This is at least 2min(2(3, W)/\5q\. Thus we are allowed to apply (5.561 when < 


2min(2(3, W). Since Q > x/U, we know that min(2Q, W) = W for all W < x/U, 
and so it is enough to assume that |5g| < 2W. We will soon be making a stronger 
assumption. 

Recalling also (|6.19|l, we see that (|5.56|l gives us 


( 


S 2 {U',W',W) < min 


1 ,- 


2 q/(j){q) 


\ 


V 


log ^ 


1% 


(s 


w 

T 


-W{\ogW). 


(6.32) 

Similarly to before, we define Wq = max{V, 0\Sq\), where 9 > 3e^/8 will be set 
later. (Here 9 > 3e^/8 is an assumption we do not yet need, but we will be using it 
soon to simplify matters slightly.) For W > Wq, we certainly have |(5(7| < 2W. Hence 
the part of the first term of (6.151 coming from the range Wq <W < x/U is 


/ - dw 

4 / y'S,iU,W)-S2iU,V,W) — 
JWn 


fxlU 

'Wo 

< 4 


4>{q) Jwo 


\ 


Si(c;,w') 


logic (Wx ^W‘^\^dW 


log(^ 


1+^ 


V l<5g| 


w 


(6.33) 
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By (5.34 1 , the contribution of the term Wx/\5q\ to (6.33 i is at most 


4x 

\/WM^ 


j.x/U 

'Wo 


i 




log VF 


dW 


4W 1 
FI? ■ 1+5^ 


W 


Note that 1 + {x/U)/2Q <3/2. Proceeding as in (6.231-(6.31 1 , we obtain that this is 
at most 

2 x 




log 


C/Wo 


+ $ Ke log 


UWr 


+ 2k7 , 


where 


$ = 


_ log aM (log log jiIIj - log log f ) If v/e < I j,| < i/w, 


(6.34) 


where ei = xjTUQ. This is what we think of as the main term. 

By (6.181, the contribution of the term W^/2 to (6.33 i is at most 


g 

</(g) 


r/^ dw 

I Wo 


/ logW 

2'^yW Wo<W<i V log ■ 


(6.35) 


Since t —>■ (logf)/(logf/c) is decreasing for t > c, (6.35 i is at most 




log 1^0 

Ing- 8 Wo ■ 

iOg 3|5q| 


(6.36) 


If Wo > V, we also have to consider the range V < W < Wq. By Prop. 5.2.4 and 
(|6.19|l, the part of (|6.15|l coming from this is 


re\sg\ 


'v 


Wx 


dW 


. . Wx W2 


The contribution of W^/4 is at most 

rWo I ~ pp2 _ 

4 y Y Ki — log W • — — < 40^1 • Va:Wo • ^lo^; 
the sum of this and ( |6.36| ) is at most 

< • J-^^y/\ogWo, 
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where we use the facts that Wq = 6*|(5(7| (by Wq > V) and 6 > 3e^/8, and where we 
recall that K 2 = 4y^. 

The terms Wx/2\6\q and lTa;/(16(l — p)Q) contribute at most 


X 


{\ogW)W 


X 





dW 

W 


f 1 1 \ /^^dW 


The term a;/8(l — p) contributes 



logfF dW 
W{l-p)l¥ 


< 


V^x r°° VWW 

yw Jv 


\/2(i - p)y 


where we use the estimate 



VWw 

Vt/3/2 


W 

1 



~ W Ji 

_^VWv 

w 


Vlog w + log F , 

— 

Vlog V 1 1 logu 

#/2 ^^VvJi 


* * * 


It is time to collect all type II terms. Let us start with the case of general 6 . We will 
set 0 > e later. If g < V129, then |S'//| is at most 


X 

\/2(/'(g) 



+ log 2 q log 



\ogxlUV 

logL/2g 



X 

UV 





1 + 1.15. 


/ log 2g \ 

\ogx/2Uqj ^/U ' \/V 


Kg- 


(6.37) 
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lfV/26<q< x/29U, then \Sii \ is almost 


+ \/2^2 


1 ^ , logx/2Uq\ ( ^ X 

vTW, + j *“«cT^ + 


. 1 + 1.15W I + (K2\/log20q + Kg)-^ 

</'(?) \ ^logx/2Uqj y/u Wv 


6 


((log26»g)3/2 - (logy)3/2) 


a; 


+ K2 (^\/ 20 . 1 og 20 g+-((log 209)"/2 - {\ogVf/^)j 

(6.38) 

where we use the fact that Q > x/U (implying that pg = max(l, 2q/Q) equals 1 for 
q < xl2U). Finally, if g > a;/20{7, 


\Sll\ < {K2\/2\ogx/U + '^ 9 );^ + K 2 \/\ogx/U ^ 

+ ?|^((logx/(7)3/2 - (logf^)3/2) . 


(6.39) 


Now let us examine the alternative bounds for |i5| > 8. Here we assume 0 > 3e^/8. 
If 1'5'zl < V/0, then |5'//| is at most 


2a; 


vwm\ uv 


K6 log — + 2^7 


, X , , |5g|(l + ei) \ogx/UV 

log 7777 + log-1-log 1 + 


log 


4 V 


l'5|(l+ei)9 , 


(6.40) 


K2i 


2q 


logV 


X X 

Kg- 


^ ^(q) Vlog2F/|<)<z| Vu "Vv’ 

where ei = xl2UQ. \IV/0 < |(5|<7 < xl0U, then \Sii\ is almost 
2x 


1 - , , 3|()(7| ^ log 3^15,1 \ ( ^ X 

,/\5\^) ^ C/ • 0|<5|g + 8 log 80/3 ) U ■ 9\Sq\ 


2/^7 


2/^2 lx X 


+ 


((log0|<)g|)3/2-(logf^)3/2) 

+ vFkTr) + ^ 


(6.41) 
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where p = q/Q. Note that |i5| < xjQq implies p < x/Q^, and so p will be very small 
and Q — q will be very close to Q. 

The case |i5g| > x/OU will not arise in practice, essentially because of |(5|g < xjQ. 


6.3 Adjusting parameters. Calculations. 


We must bound the exponential sum A{n)e{an)r]{n/x). By (3.8 1 , it is enough to 
sum the bounds we obtained in §6.2| We will now see how it will be best to set U, V 
and other parameters. 

Usually, the largest terms will be 


CoUV, 


(6.42) 


where Co equals 


C4^/2 ~t“ cg^j^ — 4.39779 -t- 5.21993co 

C 4./2 + (1 + e)ci 3 .z, = (4.89106 + 1.31541e)(l + eo) 


(from (6.13 1 and (6.14 1 , type I; we will specify e and eg 
and 


if |5| < 1/2 c 2 - 0.74463, 
if |5| > 1/2 c2 

(6.43) 

= (4log2)/(x/UV) later) 


\/<5o^(9)\ 


log — + (log(5o(l + ei)<7) log 


1 + 


log 


uv 


log 


V 

i5o(l+ei)g , 


Kg log — + 2 k 7 


(6.44) 


(from (6.37i and ( 6.40| i, type II; here Jq = max(2, |5|/4), while ei 
|(5| > 8 and ei = 0 for |5| < 8. 

We set UV = ^cxj^/qSo; we must choose >f > 0. 


x/2UQ for 


Let us first optimize (or, rather, almost optimize) x in the case |5| < 4, so that 
^0 = 2 and ei = 0. For the purpose of choosing x, we replace \/(j){q) by yfqjCi, 
where Ci = 2.3536 ^ 510510/^(510510), and also replace V by q^/c, c a constant. 
We use the approximation 


log 





log 1 + 


log(v^/>r) \ 

log(g/2c) J 


3 2log2i/c/>f 
^°®2+ 3logg/2c 



log2v/c/>tr \ 
logq/2c ) 
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What we must minimize, then, is 


Cqx Cl 


V^\ 

Cq^c Cl 


( 3 21og^' 

log^+log2g log- + 


31og^ 


K6 log 




< 






/ Atv 

' k'i log q — 2 k'i log XT H-^ log 2 

Kq 


< 


Cn 




XT + K 4 (^k[ log g - ^ ^ 


where 




1 log 4c 

31^ 


log K + 4 


2k7 


(6.45) 


/ 

Ki 


1,3 , , 3 log 4c log 2g 

J + Iogj. «j = log-/2 + log2bg- + ^j^^^, 


2 


44^7 


Ke 


+ 4 log 2) = 


log 4c (log 4c) ^ 


Cl 


Cn V 24 


Ke 


0.30915 


6 61og|; 

if \S\ < 4 


''5> 


l+1.18694eo 

_ 0-27797 _ jf I Cl ^ A 

(l+0.26894e)(l+eo) ^ ’ 


1 q 4 ^' ^ 

4 = o (log + log 2 log - H-1-^ +4 log 2) 1.01152. 

2 2 ACg 

Taking derivatives, we see that the minimum is attained when 


llog4c\ , 

31og^j''" 


1.7388 


log 4c 

Ic 


0.30915 


3 log 47 l + 1.19eo 


(6.46) 


provided that |(5| < 4. (What we obtain for |i5| > 4 is essentially the same, only with 
Sqq = Sq/A instead of 2q and 0.27797/((l + 0.27e)(l + eg)) in place of 0.30915.) For 
q = 5 ■ 10®, c = 2.5 and |^| < 4 (typical values in the most delicate range), we get that 
K should be about 0.5582/(1 + l.lOeg). Values of q, c nearby give similar values for 
XT, whether |i5| < 4 or for |5| > 4. 

(Incidentally, at this point, we could already give a back-of-the-envelope estimate 
for the last line of ( |6.45| l, i.e., our main term. It suggests that choosing w = 1 instead 
of ui = 2 would have given bounds worse by about 15 percent.) 

We make the choices 

x: = 1/2, and so UV = — y — 

^ 2VqSo 

for the sake of simplicity. (Unsurprisingly, ( |6.45| l changes very slowly around its min¬ 
imum.) Note, by the way, that this means that eg = (2 log 2) /y/qS^. 

Now we must decide how to choose U, V and Q, given our choice of UV. We will 
actually make two sets of choices. 
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First, we will use the S /,2 estimates for q < QjV to treat all a of the form a = 
a/q + 0*{l/qQ), q < y. (Here y is a parameter satisfying y < Q/V.) 

Then, the remaining a will get treated with the (coarser) S '/2 estimate for q > 
Q/V, with Q reset to a lower value (call it Q'). If a was not treated in the hrst go (so 
that it must be dealt with the coarser estimate) then a = a'/q' + 5'/x, where either 
q' > y or 5'q' > x/Q. (Otherwise, a = a'/q' + 0*{l/q'Q) would be a valid estimate 
with q' < y.) The value of Q' is set to be smaller than Q both because this is helpful 
(it diminishes error terms that would be large for large q) and because this is harmless 
(since we are no longer assuming that q < Q/V). 


6.3.1 First choice of parameters: q<y 

The largest items affected strongly by our choices at this point are 


C16,/2 2 + 


1 + e 


and 


log’ 


2UV\S\q\ X 


X J Q/V 
U 


+ C 17 J 2 Q (from Si, 2 , |i5| > 1/2 c2), 


cio ,/2 log — + 2 c 5,/2 + ci 2,/2 ) Q (from Si, 2 , I<5| < l/ 2 c 2 ), 


(6.47) 


K2\ 


2q 


m \og!/mq) vu ' '^'^Vv 


X X 

Kg—/= (from S'//, any |de(fa|), 


(6.48) 


with 


K2\ 


2q 


\ogV 


(j){q) V \og2V/\5q\ ^ 


—— (from S/i) 


as an alternative to (6.48 1 for |(5| > 8 . (In several of these expressions, we are apply¬ 


ing some minor simplihcations that our later choices will justify. Of course, even if 
these simplihcations were not justihed, we would not be getting incorrect results, only 
potentially suboptimal ones; we are trying to decide how choose certain parameters.) 
In addition, we have a relatively mild but important dependence on V in the main 


term (6.44i, even when we hold UV constant (as we do, in so far as we have already 


chosen UV). We must also respect the condition q < Q/V, the lower bound on 
U given by (|6.17|l, and the assumptions made at the beginning of the chapter (e.g. 
Q>x/U,V >2- 106 ). Recall that UV = x/2^/^. 

We set 

^ 82 /’ 

since we will then have not just q < y but also < x/Q = 8y, and so qSg < 2y. 
We want q < Q/V to be true whenever q <y\ this means that 


^ ^ Q _ QU 


QU 


V UV x/2^0 


Uy/Wr> 

Ay 
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must be true when q < y, and so it is enough to set U = Ay^jy^qSo. The following 
choices make sense: we will work with the parameters 


y = 
u = 


^1/3 

6 ’ 

4j/2 


Q=^ = 
Sy 4 

.t2/3 




V = 


x/UV = 2^/^ < 2^^, 

X X 9x^/3 


(6.49) 


{xlUV) ■ U 8?/2 


where, as before, Jq = max(2, |(5|/4). So, for instance, we obtain ei < xjTUQ = 
Q\/qSo/x^/^ < 2-\/3/a:^/®. Assuming 


X > 2.16 • 10 


20 


(6.50) 



y < x/4, UV < X, Q > max(16,2i/x), Q > max{2U,x/U), 
as stated at the beginning of the chapter. Let 0 = (3/2)3 _ 27 / 8 . Then 


(6.51) 


V xjSy^ 


> 


= 4> 1, 


2eq 29q “ 1602/3 54^3 

V ^ xjSy^ > ^ ^1 

9\6q\ ~ 86y ~ 6402/3 2162/3 


(6.52) 


The first type I bound is 


\Si,i\ < - min 1 , 


<52 


/ / 


4 q 

5 0(g) 

1 + ai3 /9 

log Vr 






/ 


(^log 9a: 3 + C 3 ,/) + 


\ 


/ 


+ (^CTj log — + C 8,7 log a;^ 2/ + — r (log 9 a;^/ 3 ^eq6o) 

V C 2 J 3422g3/2^2 


22-2/3 .^5/3 

+ ( C5 ./ log -^== + C6 ./ log 




.2/3 


^ 2-yea; cio,/ 
+ cgjVa^log-^- 


9C2y^ ’ 9y/^J 9\/q5^ ' ’ C2 

where the constants are as in ( 6.2. 1| For any c,R> 1, the function 

X — )■ (logca;)/(loga:/i?) 

attains its maximum on [R', 00 ], R' > R, at x = R'. Hence, for q6o fixed, 

415 


(6.53) 


log 


+ 4x3/3 ’ 

9((5o(j)t 


9x3 ^/q^ + C3,J 


(6.54) 
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attains its maximum for x G oo) at 

3/2 


X = 


(^9e^/^(Soq)^/y4^ = C27/S)e^/^[q5of^/\ (6.55) 


Now, notice that, for smaller values of x, ( |6.54 1 increases as x increases, since the term 
min(..., 1) equals the constant 1. Hence, (6.54i attains its maximum for x £ (0, oo) 
at (|6.55|), and so 



1 (^log9a;3 + C3,/^ + C 4 ,/ 


27 7 

< log + C 3 J + C4J < - log(5og + 6.11676. 


Examining the other terms in (6.53|l and using (6.50 1 , we conclude that 


|5/,i| < ^min(^l,^ 


^ (1 log +6.11676 




(0.67845 logX - 1.20818) + 0.37864x^/3^ 


(6.56) 


where we are using ( |6.50| l (and, of course, the trivial bound S^q > 2) to simplify the 
smaller error terms. We recall that Cq = 0.798437 > co/(27r)^. 

Let us now consider Si, 2 . The terms that appear both for |5| small and 1 large are 
given in (6.12|i. The second line in (6.12|i equals 


C 8,/2 


i — 

V 


< C 8,/2 


2UV^ qV^ 

- 1 - 

X X 


+ 


< 0.29315 


4g2,5o 2^/2 8 

X 


+ 


Cioj 1 

^ q x2/3 ' 

2 1 

^2v^ ISg^o, 

Cl0,7 


x2/3 \ 

2 

1^23/2 

l8q6o ) 


log 


9x1/3 




(0.086791ogx +0.39161) 


r"/3 


0.00153v/x, 


where we are using ( |6.50 1 to simplify. Now 

min ( ^, 1 ) log Eg = min 

W / 


4/5 \ , 9xi/3g 

-^,1 )log- 




(6.57) 


can be bounded trivially by log(9xi/3g/2) < (2/3) log x +log 3/4. We can also bound 
( |6.57| i as we bounded ( |6.54| i before, na mely, b y hxing q and hnding the maximum for 
X variable. In this way, we o btain that (6.57 1 is maximal for y = Ae^^^q^', since, by 
definition, x^/^/h = y, (6.57 1 then equals 


9f6 • 4 

log - = 3 log ^ + log 108 + - < 3 log Q + 5.48214. 

z 0 
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We conclude that (|6.12|l is at most 


min ( 1, ^ 
+ 0.29315 


log 9 + 2.74107 






X 


q'^So 


(0.04341oga; + 0.1959)a:2/^ 


(6.58) 


If |5| < 1/2 c 2, we must consider (6.131. This is at most 

^2/3 


(c 4,72 + Cg^/J 
< 


2 -\/ qSo 
2.1989a; 3.61818a: 


^ 9g3^/2y5o + ■ 4^"^" 




qSo 


+ (1.77019 log a; + 29.2955)x^/^ 


where we recall that eg = (4 log 2)/ (x/UV) = (2 log 2) /^/qSo, which can be bounded 
crudely by-\/2 log 2. (Thus, cioja — \/l +\/8 log 2-1.78783 < 3.54037 and cig,/a ^ 
29.3333 + 11.90272 log 2 < 41.0004.) 

If |5| > 1/2 c 2, we must consider (6.14i instead. For e = 0.07, that is at most 

, .. . X / 21og2\ 

(C4,,. + (l + c)c.3,,.)j^(l + ^j 

+ (3.38845 (1 


^)log5,= +20,8823)^ 
+ (68.8133 (l + Tp) log |5|4 + 72.0828) 


= 2.49157 


VqSo 


l+2^\ _^(3 38845iogj^3_^32 677i)^ 

vqoo ) loll? 

22.9378logX + 190.791 + 130.691 ) x? 

V 900 / 

< 2.49157^^ + (3.59676 log Jo + 27.3032 log g +91.2218)^ 

V 900 900 

+ (22.93781ogx + 411.228)xs, 

where, besides the crude bound cq < -72 log 2, we use the inequalities 


< 


< 


< 


log 8 

log? ^ 

1 log? ^ 1 loge^ 72 

72 ’ 

7^ ~ 

72 7 ? ~ 72 e e 

4c2 

log|J| ^ 

2 log Jo 

Jo ’ 

1^1 " 

e log 2 Jo 


(Obviously, 1/| J| < dcg/Jo is based on the assumption |(5| > l/2c2 and on the inequal¬ 
ity 16 c 2 > 1. The bound on (log |J|)/|J| is based on the fact that (logf)/f reaches its 
maximum at f = e, and (log Jo)/Jo = (log2)/2 for |J| < 8.) 
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We sum (6.58 1 and whichever one of our bounds for (6.131 and (6.141 is greater 


(namely, the latter). We obtain that, for any S, 


|5'/,2| < 2.49157 




+ min 


1,^) • Q log g +2.74107 


Hi) 


+ (3.59676 log + 27.3032 log g + 91.515)— + (22.9812 log x + 411.424)x2/^ 

Ho 

(6.59) 

where we bound one of the lower-order terms in (6.58 1 by x/q^5o < x/qSo. 


For type II, we have to consider two cases: (a) |(5| < 8 , and (b) |5| > 8 . Consider 
hrst |5| < 8 . Then i5o = 2. Recall that 0 = 27/8. We have q < V and |( 5 ( 7 | < YjB 


thanks to (6.52i. We apply (6.37 1 , and obtain that, for |5| < 8 , 


I'S'/zl < 


^/WH) \ 






'"sli 1 


■ v/0.302141og4g5o + 0.2562 
8.22088,/^^ I 1 + 1.15 




< 


\/ 2 />(g) 


• yC'^, 29 log 2 g+ • v/0.302141og2q +0.67506 


+ 16.406, /+ 1.84251X®/® 

V HH) 


(6.60) 


where we bound 


log 2 g 




log + 5 - 

< -< lim 


1 

log V 


log 


1/6 


log 




1/6 


= 2 , 


and where we dehne 


Cx,t log 1 + 


log 4f 

2 log; 

^ ^og 2.0044 , 


for 0 < f < 9x^/^/2. (We have 2.004 here instead of 2 because we want a constant 
> 2(1 + ei) in later occurences of for reasons that will soon become clear.) 

For purposes of later comparison, we remark that 16.404 < 1.57863x'^/^“^/^ for 
X > 2.16 • 102 O. 


Consider now case (b), namely, |(5| > 8 . Then 80 = 


|,5|/4. By (6.521, |,5g| < VjB. 
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Hence, (6.40 1 gives us that 




2 x 


V0.302141og|5|g +0.2562 


I, ir I , l<^9l(l + ei) , log|<5|g \ 

- log |5g| + log ^ log 1 + 


2 loe I 


+ 8.22088 


< 


<?!>('?) ^ log 


log 

l^9l 


+ 1.84251a;®/® 


\JCx,5oq log<^o(l + ei)g + ^°^^^^ A/0.302141og Jog + 0.67506 


y<5o^(g) 

+ 1.79926, O^a;^/® + 1.84251a;®/®, 

V nn 


(6.61) 


8.22088 




• (g<5o)'/" < 8.22088Wi-^^ 

log^ V logT 


^ • (x®/®/3)®/^ 


< 1.79926a;'‘/®"2/® 


fora; > 2.16 • lO^®. Clearly 


log(5o(l + ei)g = logSoq + log(l + ei) < log(5og + ei. 


By Lemma 
check that x —; 


C.2.2 


q/4>{q) < F{y) = F(a;^/®/6) (since x > 18®). It is easy to 
■JF (a;®/®/6)x^/®“®/® is decreasing for x > 2.16 • 10^® (in fact, for 


18®). Using (6.50 1 , we conclude that 1.67718-\/g/(^(g)a;"*^/® < 0.89657a;®/® and, by 
the way, 16.406-\/g/0(g)a;®/'^ < 0.78663a;®/®. This allows us to simplify the last lines 
of (6.60 1 and (6.61|l. We obtain that, for 5 arbitrary. 




Cx,Soq{^og5oq + ei) 


log 4(5og ^q_ 3 Q 214 ipg 0.67506 


\/< 5 o</'(g) 

+ 2.73908a;®/®. 

(6.62) 

It is time to sum up S'/.!, S '/2 and S//. The main terms come from the hrst line 
of (6.62 1 and the hrst term of ( |6.59| l. Lesser-order terms can be dealt with roughly: 
we bound min(l, Cq/ 5^) and min(l, Ac'q/5^) from above by 2/5o (using the fact that 
Cq = 0.798437 < 16, which implies that 8/S > 4cg/S^ for (5 > 8; of course, for S < 8, 
we have min(l, 4 cq/; 5^) < 1 = 2/2 = 2/So). 
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The terms inversely proportional to q, </)(q) or thus add up to at most 


+ (3.59676 log (5o + 27.3032 log 9 + 91.515)4- 

goo 

2x /13 \ 2x 

— log Jo? + 7.81811 I + —(13.6516 log (5 o9 + 37.5415), 
4 J doq 


SoHg) 


where, for instance, we bound (3/2) logg + 2.74107 by (3/2) log^og + 2.74107 — 
(3/2) log 2. 

As for the other terms - we use the assumption x > 2.16 • 10^° to bound 
and x"^^^ log a; by a small constant times We bound x 2/3/y^by x2/3/v^(in 
( |6.56| l). We obtain 

2/3 

(0.67845 log X - 1.20818) + 0.37864x5 

v2 

+ (22.9812logX + 411.424)x5 + 2.73908xs < 3.35531x^/®. 


The sums S'o.oo and in (3.111 are 0 (by (|6.50 1 and the fact that 772 (f) = 0 for 
f < 1/4). We conclude that, forq<y = x^/^/6, x > 2.16 • 10^° and 77 = 772 as in 

0, 


\S^{x,a)\ < |5/,i| + |57,2| + |^//| 

^ , \/^^.^oa(log<^og + 0-002) + ^°^f°'^ V0.302141og(5og + 0-67506 

V<7o</(9) ^ 

+ f ^ log(5o9 + 7.81811^ + ^(13.6516 log (5 o9 + 37.5415) 

y%q So(l)[q) V 4 J dog 

+ 3.35531X®/®, 

(6.63) 

where 

(5o =max(2,|5|/4), Cx.t = log | 1 + ) ■ (6-64) 

\ ^ 2.004i / 

Since Cx,t is an increasing function as a function of t (for x fixed and t < 9x1/3/2.004) 
and doq < 2y, we see that Cx,t < Cx, 2 y It is clear that x 1 — Cx,t (fixed t) is a 
decreasing function of x. For x = 2.16 • 10^°, Cx, 2 y = 1.39942 .... 


6.3.2 Second choice of parameters 

If, with the original choice of parameters, we obtained q > y = x 1/3/6, we now reset 
our parameters {Q, U and V). Recall that, while the value of q may now change (due to 
the change in Q), we will be able to assume that either q > y or\Sq\ > x/{x/ 8 y) = 8 y. 





















126 


CHAPTER 6. MINOR-ARC TOTALS 


We want U/{x/UV) > 5 • 10® (this is (6.17 1 ). We also want UV small. With this 
in mind, we let 


,1/3 


V = 


U = 500V6x^/®, Q=^r = 


,2/3 


U 50076 ' 


(6.65) 


Then (6.17 1 holds (as an equality). Since we are assuming (6.50 1 , we have V > 2 - 10®. 
It is easy to check that (6.501 also implies that U < yCcj2 and Q > 2yfx, and so the 
inequalities in ( |6.51| l all hold. 

Write 2a = a/g + i5/a; for the new approximation; we must have either q > y or 
I (51 > 8y/q, since otherwise a/q would already be a valid approximation under the first 
choice of parameters. Thus, either (a) q > y, or both (hi) |J| > 8 and (b2) |(5|g > 8y. 
Since now V — 2y, we have q > V/29 in case (a) and |5g| > V/d in case (b) for any 
0 > 1. We set 0 = 4. 

(Thanks to this choice of 0, we have < x/Q < xj9U, as we commented at the 
end of §6.2.3[ this will help us avoid some case-work later.) 

By ( |Ml ), 

\Si.i\ < ^min^l, ^loga;^/®-log50076 + C3j + C4j^^^ 


+ crj log -h cgjlogxlogcii,/ — 

' C2 X 


Q + ClO I — log-=—I- 

4a; 50076 e 


cs / log -1- C6 / log 50076a:^/® I • 50076a;^/® + cg i^/x log 

’ C2 ’ / ’ C2 


X 


< - min I 1, — 


liogx- 4.99944 + 1.00303^ 
3 (l){q) 


2.89 2/3/1 \2 

+ looo" ’ 


where we are bounding 

1 Q ^ 1 1 

C7.I log-h Csj log X log Ciij - 

C2 X 


=C8j0-Ogxf - ^C 8 , 7 (log 1500000 - log Ciij) - loga; + C 7 ,/log 

<C 8 , 7 (loga;)^ — 38 log a;. 


50076 ' 


C2 


We are also using the assumption ( |6.50| l repeatedly in order to show that the sum of 
all lower-order terms is less than (38c8,7 log a;)/(50076)- Note that C 8 , 7 (log a;)^Q < 
0.00289a:2/3(loga;)2. 

We have q/4>{q) < F (Q) (where F is as in (|C.19|) and, since Q > ^/6 ■ 12 • 10® 
fora; > 2.16 • 10®°, 


1.00303r((3) < 1-00303 log log Q -f 


2.50637 


log log 76 • 12 • 10° 
< 0.2359 log Q-f 0.79 < 0.1573 log a;. 
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(It is possible to give a much better estimation, but it is not worthwhile, since this will 
be a very minor term.) We have either q > y or 5|(5| > 8y; if > 8y but q < y, then 
|(5| > 8, and so c^jS'^q < l/8|(5|g < l/bdy < 1/y. Hence 

IS'/.il < — ^^^+0.1573^ loga;^ + 0.00289a;^/^(loga:)^ 

< 2.4719a;2/3 logcc + 0.00289x2/3(log a;)^ 

We bound |S'/^2| using Lemma [4.2.4| First we bound ( |4.50| i: this is at most 
X . ( 4c'o\ x^/^q 

/I 1 \ / ((714)2 log 3^ 304 500^6 (500^6x3/3+ l)2a; hog xi\ 

+ 2x 6x ’ 


where C4 = 1.03884. We bound the second line of this using ( |6.50| l. As for the first 
line, we have either q > y (and so the first line is at most (x/2y)(logx3/3 y/3)) or 
q < y and AcQlS'^q < l/16y < 1/y (and so the same bound applies). Hence (4.501 is 
at most 


3x2/3 ( - log X — log 18 


-0.02017x2/3 logx = 2.02017x2/3 logx-3(logl8)x2/3. 


Now we bound (4.511, which comes up when |5| < l/2c2, where C2 = 67r/5y7io, 
Co = 31.521 (and so C2 = 0.6714769 ...). Since l/2c2 < 8, it follows that q > y (the 
alternative q < y, (7|i5| > 8y is impossible, since it implies |(5| > 8). Then ( |4.51[ ) is at 
most 


2-^/ CqCi 


(cF log ^ + e (,/3 log ^ log 


uv\ 


3cix, UV 

+ — - log UV log-— 

2 y C 2 x/y 


16 log 2 


Qlog 


2 ‘“°Q/2; 

Coe3(g2 ^ Q 

Att ■ 8 log 2 • X 2 


( 6 . 66 ) 


5^4;iogf; + f;j(3c,)-/=v^iogx, 

where ci = 1.000189 > 1 + (81og2)/(2x/(717). 

The first l ine of (6.661 is a linear combination of terms of the form x2/3logC'x, 
67 > 1; using (6.501, we obtain that it is at most 1144.693x2/3 log x. (The main contri¬ 
bution comes from the first term.) Similarly, we can bound the first term in the second 
line by 33.0536x2/3 logx. Since log(coe3(52/(47r • 81og2 • x)) log<3/2 is at most 
logx^/3 logx2/3, the s econd term in the second line is at most 0.0006406x(log x)2. 
The third line of (6.661 can be bounded easily by 0.0122x2/3 logx. 

Hence, ( |6.66| l is at most 

1177.76x2/3 log X + 0.0006406x2/3(logx)2. 
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If 1^1 > 1 / 2 c 2 , then we know that |(5g| > mm{y/2c2,8y) 
(with e = 0 . 01 ) is at most 


j// 2 c 2 . Thus (4.52 1 


‘^^^UVlog 

TT 


uv 

7^ 


2.02y'CoCl 

TT 



^ X 

^ 2 // 2 c 2 

3.03 

(U6 


+ 1 


logx 


1 ) log 

- V^logX. 


1 e^UV 

- log UV log 

!// 2 c 2 2 


Again by (6.50|l, and in much the same way as before, this simplihes to 


< (1144.66 + 15.107 + 68.523)a;2/3 logx + 29.l-i&x^/'^(\ogxf 

< 1228.85x2/3(loga;). 


Hence, in total and for any |(5|, 


|S'/, 2 | < 2.02017a;^/^ loga; + 1228.85a;2/^(loga:) + 0.0006406a:2/^(loga;)^ 
< 1230.9x^/3 (log a;) + 0.0006406x^/3(logx)2. 


Now we must estimate Su. As we said before, either (a) q > y, or both (hi) 
|(5| > 8 and (b2) |5|q > 8y. Recall that 0 = 4. In case (a), we have q > X^/3/6 = 
14/2 > 14/20; thus, we can use (6.38 1 , and obtain that, if 9 < x/8U, IIS’//! is at most 



log 2q log - 


\ogx/{2Uq)\ 


log 4 ) 


Kg log 


U -Sq 


2 k 7 


1 + 1.15. 


' log x/AU 


+ {K 2 ^J\ 0 gx/U + 1^9)-^ 


(^(log8y)3/2 - (log22/)3/2^ 


K 2 (v/81ogx/C/+ ^((logx/[/)3/2 - (logy)3/2; 


vW 


(6.67) 

where F is as in (C.19i. (We are already simplifying the third line; the bound given 
is justihed by a derivative test.) It is easy to check that q —>■ (log 2g) (log log (?)/(; is 
decreasing for q > y (indeed for q > 9), and so the first line of ( |6.67| l is maximal for 

q = y- 

































6.3. ADJUSTING PARAMETERS. CALCUEATIONS. 


129 


We can thus bound (6.67 1 by a;®/® times 


/3r(e‘/3/6) ( ^ - log 8c + Q - log 3^ ^~lo^T~) (^^“4.214 




— log 24c 
log 4 


+ Y y - log 6c + Kgj Vs 


+ 


K2 


V4^ V 


M 


2t 


log 6c 


y-log6cj -(--log 3 


( 6 . 68 ) 

where t = logo; and c = 500/V6. Asymptotically, the largest term in (6.67 1 comes 
from the last line (of order even if the first line is larger in practice (while being 
of order at most t log t). Let us bound ( 6 . 681 by a multiple of 
First of all, notice that 


d 


y)_(.’ic,g{|-iog6) + y*yy y. 


dt log t 


logf 


t{\ogtY 


„7 _ 2.50637 


(6.69) 


log2(|-log6) 


e^ + 


2.50637 


log2(|-log6) log (I -log 6) 


(f — 3 log 6) logf 
which, for t > 100, is 


> 


t log t log t 

„7 1„„0 2.2.506371ogt o nn^on 

^ 1.95671 - » 


> 0 . 


f(logf)^ “ f(logf)2 

Similarly, for t > 2000, 

2.50637 log t _ 2.50637 

d ^2 log2(f-log48c) log(f-log48c) 


dt log t 


> 


t{iogty 


> 


0.72216 - 

logt 


t{\ogty 


> 0 . 


Thus, 


/ e*/3\ 
r — < (logf) • lim 

' ^ ' s—>-oo 


F 


6 J 

,2t/3 


48c 


F 



logs 


/ ^2s/3 \ 

F 

(«.) 


< (logf) • lim 

s->.oo log s 


= log t 
= log t 


for t > 100, 


for t > 2000. 


(6.70) 
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Also note that, since = (3/2)-y/x, 




•log4< 1.20083v^. 


for t > 2000. We also have 

(I - log6c) ' - (5 - loss) “< (I - logs) ■ - () - logs) 

= ( 2^/2 _ 1 ) Q _ log 3^ ' < ( 2^/2 - 1 )^ < 0.35189f3/2^ 


Of course, 

t ft \ \-\og 2 c f t t t 

3 - l°g«' + (3 - ‘°«=’) ‘‘”5 -Sis- ^ (3 + 3 ‘‘"S 3 ) < 3 

We conclude that, for t > 2000, ( |638l l is at most 
^3 • e'l' log t ■ ^ log ^ a/ e'^ log t ^1 + 0.79749-\/f^ 

+ vf ^ ^ + I ■ 0.35189f3/2 

< (0.10181 + 0.00012 + 0.00145 + 0.000048 + 0.00462)f3/2 < 0.10848^^/^. 


On the r emai ning interval log(2.16 • 102°) < t < lo g 2000 , we use interval arith¬ 
metic (as in (2.6 with 30 iterations) to bound the ratio of ( 6 . 681 to f^/2 \ye obtain that 
it is at most 

0.275964f3/2^ 


Hence, for all x > 2.16 • 102°, 

\Sji\ < 0.275964x®/®(logx)3/2^ 


(6.71) 


in the case y < q < x/8U. 


If x/8U < q < Q, we use (6.39 1 . In this range, x/2-y/2g + y/qx adopts its 


imum at q = Q (because xl2y'2q for q = x/8U is smaller than ^tqx for q = Q, by 
(6.65 1 and (6.50i). Hence, (6.391 is at most x®/® times 


^K2^2 Qf-logc'^ +K9^ V3-\- K2]J‘^t-\ogd ■ ^ 


^ Qi-logc')^-(^-log3 



2^2 Vd 
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where t = log x (as before) and c' = 500-\/6- This is at most 


(2k 2 xi ~z x/t A 


2k2 2^/2 _ 1 3 / 


33/2 


t2 


2^2 


< 0.10327 


for t > log (2.16 • 102°), and so 

|S'//| < 0.10327x^/®(logx)3/^ 

for x/8U < q < Q, using the assumption x > 2.16 • 102°. 

Finally, let us treat case (b), that is, |i5| > 8 and > 8y; we can also assume 
9 < y, us otherwise we are in case (a), which has already been treated. Since |<5/a:| < 
1 / qQ, we know that 


N<^ = C/ = 500v^x'/><^ = ^ = ^, 
Q 2000v^ 4t7 9U" 


again under assumption (6.50l. We apply (6.411, and obtain that \Sii\ is at most 


2xy/Fiy) 


+ 


+ 


ySy 

2^2 


, X , , \ogx/3Uy\ / , X \ 

‘°s u-Tg-y + ‘“S ‘“S-toiWT ) * U^y 7 

x/A 


^ ((log32y)i-(log2y)i) 




K2 


VQ-y 


((log4C/)2 - (log2y)2) 


\^2{1-y/Q) 
«2\/r(y) • \/log4t/ • 


(\/logF + v^l/log + Kg 


w 


x/u' 


(6.72) 


where we are using the facts that (log 3f/8) jt is increasing for t > 8y > 8e/3 and that 

d (logt)3/2 — (log 17)3/2 3(logf)l/2 — ((logt)3/2 — (log 17)3/2) 
dt i/t 2f3/2 

log J 3 • v^Iop- (log 17)3/2 

=-3573-<» 


for f > 0 • 8y = 1617, thanks to 


log 


1617 


log 1617 > (log 17)3 hog 16 - 2 log — 


16 


16 


+ I ( log 7a ) “ ^ log 16 


(logl^)' 

log 17 > (log 17)3 


2 
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(valid for log V > 1). Much as before, we can rewrite (6.721 as x®/® times 


2v/r(e‘/3/6) jt 


O + log “ log 3 



^2(1-^) V ^t/3-\og3^ 


(6.73) 

where t = logx and c = 500/-\/6. For t > 100, we use (6.70l to bound F(e‘/®/6), 
and we obtain that (|6.73|l is at most 


2v^ n ^ 


2k2 


3 3 

1/4 


2 k 2 /3 1 /f 


o ■ ^ • (logf)i+ -ir\ o ■ o 7 + log“^ -log 16 


32 \ 


1/2 


2 V3 

\ 1/2 


6 J 




glOO/S 
6c 

K2VS 


I /1 \ ' 

= • 2 ( 3 + log 24c j • log 72c 




1 - 


glOO/S 


) V 


( \A73 + 




Kg 




(6.74) 

where we ha ve bou nded expressions of the form (a > 6) by (a^/^/2)-(a—6). 

The ratio of (6.741 to clearly a d ecrea sing function of t. For t = 200, this ratio 

is 0.23747...; hence, ( 6.74 i (and thus ( 6.73 1) is at most 0.23748/®/^ for t > 200. 

On the range log(2.16 • 10^°) <t< 200, the bisection method (with 25 iterations) 
gives that the ratio of (6.73 1 to is at most 0.23511. 

We conclude that, when |^| > 8 and > 8y, 

|5//| < 0.23511x®/®(logx)®/^ 

Thus ( |6.71| i gives the worst case. 

We now take totals, and obtain 


^r,(a;,a)< |^/,i| + |5/,2| + |577| 

< (2.4719 + 1230.9)x 2/® logx + (0.00289 + 0.0006406)x2/3(logx)2 
+ 0.275964x®/®(logx)®/^ 

< 0.27598x®/®(logx)®/^ + 1233.38x2/® logx, 


(6.75) 
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where we use (6.501 yet again. 


6.4 Conclusion 


Proof of Theorem \3.1 .1\ W e have shown that |5',,(a,a;)| is at most ( 6.63 1 for q < 
and at most (6.751 for q > It remains to simplify (6.631 slightly. 

By the geometric mean/arithmetic mean inequality, 

y a, 5 „,(log Soq + 0.002) + v/0.30214 log Soq + 0.67506 (6.76) 


IS at most 


^ (^a,5„,(log,5o7 + 0.002)+^°®^^ (0.30214log<5o<? +0.67506) 

for any p > 0. We recall that 


Cx,t — log 1 + 


log At 
2 log; 

^ ^og 2.0044 , 


Let 


, = = 3.397962.... 


0.30214 log 2qo +0.67506 
where xi = 10^®, qo = 2 - 10®. (In other words, we are optimizing matters for x = xi, 


Soq = 2go; the losses in nearby ranges will be very slight.) We obtain that (6.76 1 is at 
most 


a 


f 1 




i^(log3„, +0.002) , 

1 /lo^ y[p 

2 Wp 2 

< 0.27125C^,t(log S^q + 0.002) + 0.4141 log d^q + 0.49911. 


0.67506j 


Now, for a; > xo = 2.16 • 10^*^, 


Cx 


< 


a 


Xo,t 


log t log t log t 


1 ( log At 

log 1 + 


54-106 I — 


2 log 2.0044 / 

for 8 < f < 10® (by the bisection method, with 20 iterations), and 


< 0.08659 


a 


Ca 




log t log t log t 
if 10® < t < a:^/®/6. Hence 


2 log 


2.004 


9-6 / — 


< 0.08659. 


0.27125 • Cx,Soq ■ 0.002 < 0.000047 log (5og. 


(6.77) 
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We conclude that, for q 


ic, / i?x,5o9log+ 0.49911 , 2.492 x 

1 *^( 0 ;,a;)| < - - - , , , , , -• x + 


+ 


where 


2x /13 


<5o</'(9) 


\/H<l)^o V¥o 

/13 \ 2x 

( — log Jo? + 7.82 j + - —(13.66 log Jo? + 37.55) + 3.36x^/®, 


= 0.27125 log 1 + 


log 4f 


2 log 


+ 0.41415. 


2.004t , 


□ 



Part II 

Major arcs 
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Chapter 7 

Major arcs: overview and 
results 


Our task, as in Part||] will be to estimate 

Snia,x) = E A{n)e{an)r]{n/x), (7.1) 

71 


where 77 : IR+ —C us a smooth function, A is the von Mangoldt function and e{t) = 
g27r2t j-[g]-g^ yyg yyjjj f}jg g^sg Qf Q, lying on the major arcs. 

We will see how we can obtain good estimates by using smooth functions rj based 


on the Gaussian This will involve proving new, fully explicit bounds for the 

Mellin transform of the twisted Gaussian, or, what is the same, bounds on parabolic 
cylindrical functions in certain ranges. It will also require explicit formulae that are 
general and strong enough, even for moderate values of x. 

Let a = a/q + 5/x. For us, saying that a lies on a major arc will be the same as 
saying that q and 6 are bounded; more precisely, q will be bounded by a constant r and 
|(5| will be bounded by a constant times r/q. As is customary on the major arcs, we 
will express our exponential sum (3. 11 as a linear combination of twisted sums 


E A(n)x(n.)e(^n/a;)p(n/a:), (7.2) 

n—1 


for X : Z —>■ C a Dirichlet character mod q, i.e., a multiplicative character on {'LIq'L)* 
lifted to Z. (The advantage here is that the phase term is now e{6n/x) rather than 
e{an), and e{Sn/x) varies very slowly as n grows.) Our task, then, is to estimate 
S^^^{6/x,x) for (5 small. 

Estimates on rsly on the properties of Dirichlet L-functions L(s, x) = 

En ■ What is crucial is the location of the zeroes of L(s, x) in the critical strip 

0 < 3?(s) < 1 (a region in which L{s, x) can be defined by analytic continuation). In 
contrast to most previous work, we will not use zero-free regions, which are too narrow 
for our purposes. Rather, we use a verification of the Generalized Riemann Hypothesis 
up to bounded height for all conductors q < 300000 (due to D. Platt llPlabl l. 


137 





138 


CHAPTER 7. MAJOR ARCS: OVERVIEW AND RESULTS 


A key feature of the present work is that it allows one to mimic a wide variety 
of smoothing functions by means of estimates on the Mellin transform of a single 
smoothing function - here, the Gaussian 


7.1 Results 

Write r]'y{t) = e~* Let us first give a bound for exponential sums on the primes 
using rj'y as the smooth weight. Without loss of generality, we may assume that our 
character x mod q is primitive, i.e., that it is not really a character to a smaller modulus 

q'\q- 

Theorem 7.1.1. Let x be a real number > 10®. Let xbe a primitive Dirichlet character 
mod q, 1 < q < r, where r = 300000. 

Then, for any d £ K with |i5| < 4r/g, 


OO 

^ A(n)x(n)e 

n—1 



e 


(n/x)2 

2 


= Iq=i ■ ri<y{-S) ■ X + E ■ X, 


where Iq=i = 1 if q = 1, Iq=i = 0 ifq f 1, and 


\E\ < 4.306- 10"^^ 



/650400 

V 


112 


We normalize the Fourier transform / as follows: f(t) = e{—xt)f{x)dx. Of 
course, rf${—S) is just ^ . 

As it turns out, smooth weights based on the Gaussian are often better in applica¬ 
tions than the Gaussian r]<y itself. Let us give a bound based on 77(f) = f^rj^if). 

Theorem 7.1.2. Let rjif) = Let x be a real number > 10®. Let x be a 

primitive character mod q, 1 < q < r, where r = 300000. 

Then, for any <5 £ M with |i5| < Ar/q. 


^ f 5 \ 

A{n)x{n)e I -n j ijin/x) = Iq=i ■ q{-6) ■ x + E ■ x, 

n—1 ^ 


where Iq—i = 1 if q = 1, Iq=i = 0 if q f 1, and 


\E\ < 2.485 • 10"^^ 



281200 



The advantage of 77(f) = f^rj'yif) over 7715 is that it vanishes at the origin (to second 
order); as we shall see, this makes it is easier to estimate exponential sums with the 
smoothing 77 5 . where is a Mellin convolution and g is nearly arbitrary. Here is 

a good example that is used, crucially, in Part [HI] 
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Corollary 7.1.3. Let ri{t) = *m ?72(0> where 772 = rji *m Vi rji — 

2 • /[i/2,i]- Let X be a real number > 10®. Let x be a primitive character mod q, 
1 < q < r, where r — 300000. 

Then, for any 5 S K with |i5| < 4r/g, 


00 / s \ 

A(n)x(n)e f -n j ^{n/x) = Iq=i ■ v{-S) ■ x + E ■ x, 

n—1 ^ 


where Iq=i = 1 if q = 1, Iq=i = 0 ifq 7^ 1, and 


\E\ < 2.485 • 10"^® 



381500 



Let us now look at a different kind of modification of the Gaussian smoothing. Say 
we would like a weight of a specific shape; for example, what we will need to do in 
Part [III] we would like an approximation to the function 


0 


for t e [0, 2], 
otherwise. 


(7.3) 


At the same time, what we have is an estimate for the Mellin transform of the Gaussian 
e“* centered at t = 0. 

The route taken here is to work with an approximation 77+ to rjo. We let 


77 +(f) = hnit) ■ te 


where hn is a band-limited approximation to 


h(f) 


f2(2-f)3e*-i/2 iffG[0,2], 

0 otherwise. 


(7.4) 


(7.5) 


By band-limited we mean that the restriction of the Mellin transform of hn to the 
imaginary axis is of compact support. (We could, alternatively, let hn be a function 
whose Fourier transform is of compact support; this would be technically easier in 
some ways, but it would also lead to using GRH verifications less efficiently.) 

To be precise: we define 


Pnit) 


sin(i7 logy) 
TTlogy 


hnit) = {h *M PH){y) 


f h{ty i)Ff7(y) — 

Jo y 


(7.6) 


and H is a positive constant. It is easy to check that MFfi{iT) = 1 for —H < 
T < H and MEniir) = 0 for t > H 01 t < —H (unsurprisingly, since Eh is a 
Dirichlet kernel under a change of variables). Since, in general, the Mellin transform of 
a multiplicative convolution / 9 equals Mf-Mg,we see that the Mellin transform 
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of hn, on the imaginary axis, equals the truncation of the Mellin transform of h to 
[—iH, iH], Thus, hn is a band-limited approximation to h, as we desired. 

The distinction between the odd and the even case in the statement that follows 
simply reflects the two different points up to which computations where carried out in 
IPlabI : these computations were, in turn, to some extent tailored to the needs of the 
present work (as was the shape of 77+ itself). 


Theorem 7.1.4. Let ri{t) = p+(f) = where hn is as in (7.6) and 

H = 200. Let X be a real number > 10^^. Let xbe a primitive character mod q, where 
1 < q < 150000 if q is odd, and I < q < 300000 if q is even. 

Then, for any (5 S K with |i5| < 600000 • gc(l{q, 2)/g, 


^ f 5 \ 

A{n)x{n)e I -n j qin/x) = Iq=i ■ fj{-5) ■ x + E ■ x, 

n—1 ^ ^ 


where Iq=i = 1 if q = 1, Iq=i = 0 ifq f 1, and 


\E\ < 1.3482 • lO"^'^ + 


If q = 1, we have the sharper bound 


1.617-10-1° 1 /499900 


Vx \ 


52 


\E\ < 4.772 • 10-11 + 


251400 


This is a paradigmatic example, in that, following the proof given in 19.4 


bound exponential sums with weights of the form hH{t)e~* where hn is a band- 
limited approximation to just about any continuous function of our choosing. 

Lastly, we will need an explicit estimate of the £2 norm corresponding to the sum 
in Thm. |7.1.4| for the trivial character. 

Proposition 7.1.5. Let r]{t) = 'qj^{f) = hH{f)te~* where hn is as in (7.6) and 
H = 200. Let x be a real number > IQi^. 

Then 

00 „ C50 

^^A{n){\ogn)rf{n/x)=x- / ri\{t) logxt dt + Ei ■ x logx 

n=i -^0 

= 0.640206a; log x — 0.021095a; + E 2 ■ x log x, 


where 


1^11 < 5 . 123 - 10 - 1 ° 


366.91 


/X 


\E2\ <2-10-° 


366.91 

sfx 


7.2 Main ideas 

An explicit formula gives an expression 

Sr^^^{6/x, x) = Iq=ir]{—5)x — Es{p)x^ -f small error, 


(7.7) 














7 . 2 . MAIN IDEAS 


141 


where Iq=i = 1 if g = 1 and Iq=i = 0 otherwise. Here p runs over the complex 
numbers p with L{p, x) = 0 and 0 < 3?(p) < 1 (“non-trivial zeros”). The function Fs 
is the Mellin transform of e{St)r]{t) (see j |2.4[ ). 

The questions are then; where are the non-trivial zeros p of L{s, x)? How fast does 
Fs{p) decay as Q{p) —)■ ± 00 ? 

Write a = 5ft(s), r = 9(s). The belief is, of course, that u = 1/2 for every non¬ 
trivial zero (Generalized Riemann Hypothesis), but this is far from proven. Most work 
to date has used zero-free regions of the form cr < 1 — 1 /C log g|T|, C a constant. This 
is a classical zero-free region, going back, qualitatively, to de la Vallee-Poussin (1899). 
The best values of C known are due to McCurley IIMcC84a|| and Kadiri IKadOSI . 

These regions seem too narrow to yield a proof of the three-primes theorem. What 
we will use instead is a finite verification of GRH “up to Tq”, i.e., a computation show¬ 
ing that, for every Dirichlet character of conductor q < tq (rg a constant, as above), 
every non-trivial zero p — a + ir with |t| < Tq satisfies 3fi(CT) = 1/2. Such verifica¬ 
tions go back to Riemann; modern computer-based methods are descended in part from 
a paper by Turing 0Tur53L (See the historical article IIBoo06bl .) In his thesis BPlal IL 
D. Platt gave a rigorous verification for rg = 10®, Tq = 10®/g. In coordination with 
the present work, he has extended this to 

• all odd g < 3 • 10®, with Tq = 10®/g, 

• all even g < 4 • 10®, with Tq = max(10®/g, 200 -f 7.5 • 10^/g). 

This was a major computational effort, involving, in particular, a fast implementation 
of interval arithmetic (used for the sake of rigor). 

What remains to discuss, then, is how to choose ry in such a way Fs{p) decreases 
fast enough as |r| increases, so that gives a good estimate. We cannot hope for 
Fs{p) to start decreasing consistently before |t| is at least as large as a constant times 
|(5|. Since i5 varies within (—crg/g, crg/g), this explains why Tq is taken inversely 
proportional to g in the above. As we will work with rg > 150000, we also see that we 
have little margin for maneuver; we want Fs{p) to be extremely small already for, say, 
|t| > 801(51. We also have a Scylla-and-Charybdis situation, courtesy of the uncertainty 
principle; roughly speaking, Fs{p) cannot decrease faster than exponentially on |t|/|(5| 
both for |(5| < 1 and for 6 large. 

The most delicate case is that of S large, since then |t|/|(5| is small. It turns out 
we can manage to get decay that is much faster than exponential for S large, while no 
slower than exponential for 6 small. This we will achieve by working with smoothing 
functions based on the (one-sided) Gaussian 

The Mellin transform of the twisted Gaussian e{6t)e~* is a parabolic cylinder 
function U{a, z) with 2 : purely imaginary. Since fully explicit estimates for U{a, z), 
z imaginary, have not been worked in the literature, we will have to derive them our¬ 
selves. 

Once we have fully explicit estimates for the Mellin transform of the twisted Gaus¬ 
sian, we are able to use essentially any smoothing function based on the Gaussian 
Vvit) = As we already saw, we can and will consider smoothing functions 

obtained by convolving the twisted Gaussian with another function and also functions 
obtained by multiplying the twisted Gaussian with another function. All we need to 
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do is use an explicit formula of the right kind - that is, a formula that does not as¬ 
sume too much about the smoothing function or the region of holomorphy of its Mellin 
transform, but still gives very good error terms, with simple expressions. 

All results here will be based on a single, general explicit formula (Lem. [9XT] l valid 
for all our purposes. The contribution of the zeros in the critical trip can be handled in 
a unihed way (Lemmas 9.1.3 and 9.1.4| i. All that has to be done for each smoothing 
function is to bound a simple integral (in (9.24i). We then apply a hnite verihcation of 
GRH and are done. 









Chapter 8 

The Mellin transform of the 
twisted Gaussian 


Our aim in this chapter is to give fully explicit, yet relatively simple bounds for the 
Mellin transform Fs{p) of e{St)ri^{t), where r]'y{t) = and S is arbitrary. The 

rapid decay that results will establish that the Gaussian r](y is a very good choice for a 
smoothing, particularly when the smoothing has to be twisted by an additive character 

e{5t). 

The Gaussian smoothing has been used before in number theory; see, notably, 
Heath-Brown’s well-known paper on the fourth power moment of the Riemann zeta 
function IIHB79I . What is new here is that we will derive fully explicit bounds on the 
Mellin transform of the twisted Gaussian. This means that the Gaussian smoothing will 
be a real option in explicit work on exponential sums in number theory and elsewhere 
from now onQ 


Theorem 8.0.1. Let fs{t) = e * ^^e{St), 5 S M. Let Fg be the Mellin transform of fg. 
Let s = a + IT, a > Q, T Q. Let £ = —2tt 5. Then, ifsgn{6) ^ sgn(r) and S f 0, 




Cl for p arbitrary, 

ci,a,TlF’ for p< 312. 


( 8 . 1 ) 


^ There has also been work using the Gaussian after a logarithmic change of variables; see, in particular, 
ILeh66l . In that case, the Mellin transform is simply a Gaussian (as in, e.g., IMV07I Ex. XII.2.9]). However, 
for S non-zero, the Mellin transform of a twist e((5t)e“ decays very slowly, and thus would not be 

useful for our purposes, or, in general, for most applications in which GRH is not assumed. 
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where p = At 

E{p) = ^ ( 


1 2{v{p)-iy 

^ , arccos - 
2 V vip) 



( 8 . 2 ) 


= M ' ii/vsr 


and 


v(p) = 

Ifsgn{6) = sgn(r) or S = 0, 

\Fs{s)\<\xor ■e-^^'\T{s)\e^\ 


'i + Vp^ + i 


(( 


1 + —"le-Tl-l 
^ 23/2.' 




where 


j 0.51729y^ for p arbitrary, 
- jo.84473|5[ forp< 3/2. 


(8.3) 


(8.4) 


As we shall see, the choice of smoothing function p{t) = e“* can be easily 
motivated by the method of stationary phase, but the problem is actually solved by the 
saddle-point method. One of the challenges here is to keep all expressions explicit and 
practical. 


(In particular, the more critical estimate, (8.1 1 , is optimal up to a constant depending 
on tr; the constants we give will be good rather than optimal.) 


The expressions in Thm. 8.0.1 can be easily simplified further, especially if one is 
ready to introduce some mild constraints and make some sacrifices in the main term. 

Corollary 8.0.2. Let fs{t) = e~*^^‘^e{dt), i5 € K. Let Fg be the Mellin transform of 
fg. Let s = a + ir, where a € [0, 1] and |t| > 20. Then, for 0 < k < 2, 

_,) + m <(g)‘ iMM/e < sa 

Kfe,l|T|'=/2e-0-1598|r| ifA\T\/f > 3/2. 


where 


^ 0,0 ^ 3.001, /^i,o ^ 4.903, ^ 2,0 ^ 7.96, 
Ko,i < 3.286, K 14 < 4.017, K 2 ,i < 5.13. 


We are considering Fg{s + k), and not just Fg{s), because bounding Fg{s + k) 
enables us to work with smoothing functions equal to or based on t^e~^ Clearly, 


we can easily derive bounds with k arbitrary from Thm. 8.0.1 It is just that we will 
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use k = 0,1, 2 in practice. Corollary 8.0.2 is meant to be applied to cases where r 
is larger than a constant ( 10 , say) times \l\, and a cannot be bounded away from 1 ; if 
either condition fails to hold, it is better to apply Theorem |8.0.1 [ directly. 

Let us end by a remark that may be relevant to applications outside number theory. 
By (8.91, Thm. 8.0.1 gives us bounds on the parabolic cylinder function (7(o, z) for z 
purely imaginary. (Surprisingly, there seem to have been no fully explicit bounds for 
this case in the literature.) The bounds are useful when |3(a)| is at least somewhat 
larger than |3(2:)| (i.e., when |t| is large compared to ^). While the Thm. 8.0.1 is stated 
for cr > 0 (i.e., for 3fi(a) > —1/2), extending the result to larger half-planes for a is 
not hard. 


8.1 How to choose a smoothing function? 


Let us motivate our choice of smoothing function rj. The method of stationary phase 
( 0Olv74l §4.11], OWonOll §11.3])) suggests that the main contribution to the integral 


poo 

Fs{t) = / e{6t)f]{t)C 

Jo 


dt 

T 


should come when the phase has derivative 0. The phase part of ( |8.5| l is 

e{St)t' 


j.Q{s)i _ g(2'7r(5£+r log t )2 


(8.5) 


(where we write s = cr + ir); clearly. 


{27x51 + r log t)' = 27x5 + ^ = 0 


when t = —T 127x5. This is meaningful when f > 0, i.e., sgn(T) ^ sgn((5). The 
contribution oft = —t/27x5 to (8.5 i is then 




(T+zr—1 


( 8 . 6 ) 


multiplied by a “width” approximately equal to a constant divided by 


y/\{27xi5t + T\ogty'\ = 


27r|(5| 

\/M 


The absolute value of (|8.61l is 


( \ 

— T 

V 27x5) 

27x5 


(8.7) 


In other words, if sgn(r) ^ sgn(^) and 5 is not too small, asking that Fs{a + ir) 
decay rapidly as |r| —>■ c» amounts to asking that 77 (f) decay rapidly as f —>■ 0. Thus, 
if we ask for Fs{a + ir) to decay rapidly as |t| —>■ 00 for all moderate <5, we are 
requesting that 
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1 . r]{t) decay rapidly as i —?► oo, 

2. the Mellin transform Fo{a + ir) decay rapidly as r —)■ ±oo. 

Requirement © is there because we also need to consider Fs{(7 + it) for 6 very small, 
and, in particular, for (5 = 0. 

There is clearly an uncertainty-principle issue here; one cannot do arbitrarily well 
in both aspects at the same time. Once we are conscious of this, the choice p(t) = 
in Hardy-Littlewood actually looks fairly good: obviously, 77 (f) = decays expo¬ 
nentially, and its Mellin transform r(s + ir) also decays exponentially as r — ±00. 
Moreover, for this choice of 77 , the Mellin transform Fs (s) can be written explicitly: 

i^ 5 (s) = r(s)/(l - 27r^(5)^ 

It is not hard to work out an explicit formultj^for 77 (f) = e“*. However, it is not 
hard to see that, for Fs{s) as above, ^^(1/2 -f it) decays like just as we 

expected from ( |8.7| i. This is a little too slow for our purposes: we will often have 
to work with relatively large 6, and we would like to have to check the zeroes of L 
functions only up to relatively low heights f - say, up to 50|^|. Then > 

= 0.00033 ..., which is not very small. We will settle for a different choice of 77 : 
the Gaussian. 

The decay of the Gaussian smoothing function 77 (f) = is much faster than 

exponential. Its Mellin transform is r(s/2), which decays exponentially as 5(s) —>^ 
±(X). Moreover, the Mellin transform Fs{s) (S ^ 0), while not an elementary or 
very commonly occurring function, equals (after a change of variables) a relatively 
well-studied special function, namely, a parabolic cylinder function U{a,z) (or, in 
Whittaker’s IIWhi03l notation, D_a-i/ 2 {z)). 

For 6 not too small, the main term will indeed work out to be proportional to 
g-(T/27r5) / 2 ^ jjjg method of stationary phase indicated. This is, of course, much 
better than The “cost” is that the Mellin transform r(s/2) for (5 = 0 now 

decays like rather than This we can certainly afford. 

8.2 The twisted Gaussian: overview and setup 

8.2.1 Relation to the existing literature 

We wish to approximate the Mellin transform 

F5(s) = f e~*^/‘^e{6t)F^, (8.8) 

Jo t 

where (5 S K. The parabolic cylinder function (7 : C is given by 

— z^/4 /-oo 

U{a,z) = -y. -^ / f“-5e-5* -^‘df 

_ r (i + a) Jo 

^There may be a minor gap in the literature in this respect. The explicit formula given in IHL22I Lemma 
4] does not make all constants explicit. The constants and trivial-zero terms were fully worked out for 
q = Iby |Wig2Q| (cited in IMVQ7I Exercise 12.1.1.8(c)]; the sign of hyp^ ^(z) there seems to be off). As 
was pointed out by Landau (see IHar66l p. 628]), IHL22I seems to neglect the effect of the zeros p with 
9 [^(p) = 0, ^(p) 7 ^ 0 for X non-primitive. (The author thanks R. C. Vaughan for this information and the 
references.) 
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for 3?(a) > —1/2; the function can be extended to all a,z G C either by analytic 
continuation or by other integral representations ( IIAS641 §19.5], OTemlOl §12.5(i)]). 
Hence 

Fs{s) = -27ri6^ . (8.9) 

The second argument of U is purely imaginary; it would be otherwise if a Gaussian of 
non-zero mean were chosen. 

Let us briefly discuss the state of knowledge up to date on Mellin transforms of 
“twisted” Gaussian smoothings, that is, multiplied by an additive character 

e{6t). As we have just seen, these Mellin transforms are precisely the parabolic cylin¬ 
der functions U{a, z). 

The function U{a,z) has been well-studied for a and z real; see, e.g., OTemlOL 
Less attention has been paid to the more general case of a and z complex. The most 
notable exception is by far the work of Olver 0Olv58l . Il01v59l . lOlvhll . 0Olv65l : he 
gave asymptotic series for U (a, z), a, z G C. These were asymptotic series in the sense 
of Poincare, and thus not in general convergent; they would solve our problem if and 
only if they came with error term bounds. Unfortunately, it would seem that all fully 
explicit error terms in the literature are either for a and z real, or for a and z outside 
our range of interest (see both Giver’s work and 0TVO3I .1 The bounds in 0Olv61l 
involve non-explicit constants. Thus, we will have to And expressions with explicit 
error bounds ourselves. Our case is that of a in the critical strip, z purely imaginary. 


8.2.2 General approach 


We will use the saddle-point method (see, e.g., lldB81l §5], Il01v741 §4.7], BWonOll 
§11.4]) to obtain bounds with an optimal leading-order term and small error terms. (We 
used the stationary-phase method solely as an exploratory tool.) 

What do we expect to obtain? Both the asymptotic expressions in I01v591 and the 
bounds in BOlvhlll make clear that, if the sign of r = 3(s) is different from that of 5, 
there will a change in behavior when r gets to be of size about (27r(5)^. This is unsur¬ 
prising, given our discussion using stationary phase: for |3(a)| smaller than a constant 
times |3(z)p, the term proportional to should be dominant, 

whereas for |3(a)| much larger than a constant times |3(z)p, the term proportional to 

__! f T \ 2 

g 2 V 2,r5; should be dominant. 

There is one important difference between the approach we will follow here and 
that in IIHelal . In BHelal . the integral ( |8.8| l was estimated by a direct application of 
the saddle-point method. Here, following a suggestion of N. Temme, we will use the 
identity 

i-c+ioo 

U{a,z) = ^^ ( 8 . 10 ) 


(see, e.g., HOLBClOl (12.5.6)]; c > 0 is arbitrary). Together, 
that 


and (8.10 1 give us 


Fsis) = 


,-27r-'5' 


r(s) 


c+ioo 




( 8 . 11 ) 
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Estimating the integral in ( 8.11| l turns out to be a somewhat cleaner task than estimating 
( 8 . 81 . The overall procedure, however, is in essence the same in both cases. 


We write 


(j){u) = —-— (2Tri6)u + ir log u 


for u real or complex, so that the integral in (8.11|l equals 


lis) = j 
J C 


c-\-ioo 




( 8 . 12 ) 


(8.13) 


We wish to hnd a saddle point. A saddle point is a point u at which (j)'{u) = 0. 
This means that 


— u — 2TTi6 H-=0, i.e., — iiu — ir = 0, 

u 

where I = — 27r<5. The solutions to (j)' {u) = 0 are thus 

a ± yJ—P + Air 


(8.14) 


(8.15) 


The value of (t>{u) at wq is 


4 >{ uq ) = _|_ ir logMo 

U Uq 

= iruo + IT iog—^. 

2 Ve 


(8.16) 


The second derivative at uq is 


'(uo) = - 2 (^0 + =- 2 + 2ir). 


(8.17) 


Assign the names uo,+, mq,- to the roots in (8.15 i according to the sign in front 
of the square-root (where the square-root is dehned so as to have its argument in the 
interval (—7r/2,7r/2]). We will actually have to pay attention just to mo,+, since, unlike 
Uo,_, it lies on the right half of the plane, where our contour of integration also lies. 
We remark that 


uo,+ = -^- 



(8.18) 


where the sign ± is -|- if .^ > 0 and — if £ < 0. If £ = 0, then uo,+ = (l/v^ -|- 
il^/2)yTT. 

We can assume without loss of generality that r > 0. We will hnd it convenient to 
assume t > 0, since we can deal with r = 0 simply by letting t ^ 0+. 
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8.3 The saddle point 

8.3.1 The coordinates of the saddle point 


We should start by determining uo,+ explicitly, both in rectangular and polar coordi¬ 


nates. For one thing, we will need to estimate the integrand in (8.131 for u = uo,+. The 
absolute value of the integrand is then | = and, by 

dO^ , 


= -2^(^o,+) - arg(uo,+)T. 


(8.19) 


If £ = 0, we already know that 3fi(Mo,-i-) = ^(^ 0 ,+) = \/'r/2, |mo,+ | = and 
arg uo,+ = 7 t/4 . Assume from now on that £ ^ 0. 


that 


We will use the expression for uo,+ in (8.18|l. Solving a quadratic equation, we see 


1 , 4t. , . /j(p) + l 


( 8 . 20 ) 


where j{p) = (1 + p'^Y^^ and p = 4 t jH?. Hence 




S(uo.-r) = 2 ( ^ ^ 


j{p) + 1 


( 8 . 21 ) 


Here and in what follows, the sign ± is + if £ > 0 and — if £ < 0. (Notice that 5ft(uo,+) 
and 3(rto,+) are always positive, except for r = £ = 0, in which case 5ft(Mo,+) = 
5(uo.+) = 0.) By ([8^, 


K+l = ^ 


-1 + j'(p) 


1 ± 


1 + j(p) 


1^1 / -l+j(p) . 1+J(P) , II 2'/ ^'^ 

2 V 2 2 V 2 

§^i + j{p)±2 ^^ VvipY ± y{p), 


( 8 . 22 ) 
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where v{p) = -\/(l + j(p))/2- We now compute the argument of mo,+: 
arg(uo,+ ) = ai-g U (i ± \/-l + ip] ] 




= arcsin 


= arcsin 


±1 


i+i(p) 

2 


\ 


^sjl+3{p)±2^^ 


( 


= arcsin 


±1 + 


i+j(p) \ 

2 


1+J(P) 


A 




-K 1 / , 2 

= — — - arccos \±\ - - 

2 2 I yi+j{p) 


(by cos(7r — 29) = — cos 29 = 2 sin^ 9 — 1). Thus 


(8.23) 


arg(uo,+ ) = 


T(p) = 5 arccos ^ if f > 0, 

1 arccos if f < 0. 

2 v(p) 


i arccos 

Z 


(8.24) 


In particular, arg(Mo,+) lies in [0,7r/2], and is close to 7r/2 only when £ > 0 and 
p —0+. Here and elsewhere, we follow the convention that arcsin and arctan have 
image in [—7r/2,7r/2], whereas arccos has image in [0, tt]. 


8.3.2 The direction of steepest descent 


As is customary in the saddle-point method, it is now time to determine the direction 
of steepest descent at the saddle-point mo,+. Even if we decide to use a contour that 
goes through the saddle-point in a direction that is not quite optimal, it will be useful 
to know what the direction lu of steepest descent actually is. A contour that passes 
through the saddle-point making an angle between — 7r/4 + e and 7r/4 — e with w 
may be acceptable, in that the contribution of the saddle point is then suboptimal by at 
most a bounded factor depending on e; an angle approaching —7r/4 or 7r/4 leads to a 
contribution suboptimal by an unbounded factor. 


Let w e C be the unit vector pointing in the direction of s teepe st descent. Then, 
by definition, {uo,+) is real and positive, where (j) is as in (8.12i. Thus aig{w) = 
— aig{4>” {uo,+))/2 mod tt. (The direction of steepest descent is defined only modulo 
TT.) By ([8^, 


arg((^"(uo,+)) = —TT -f arg(iftto,+ + 2ir) — 2arg(uo,+) mod27r 

TT 

= —— + arg(fMo,+ -I- 2t) — 2arg(ito,+) mod27r. 
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By (8.21 1 , 


3fJ(£uo.+ + 2 t) = - ± 


+ 2t) = — ( 1 ± 


jjp) - 1 , ^ f , 

2 £2 I 2 I ^ 


jif) - 1 


Up) +1 


Therefore, arg(ftto,+ + 2r) = arctanw, where 


1± 


j(p)+i 




It is easy to check that sgn vu = sgn i. Hence, 


arctan vj = ± -arctan 

2 


P± 


j(p)-i 

2 


1 ± 


i(p)+i 

2 


At the same time, 
P^ v 2 


(p±V¥)(it 


.7 + 1 ^ 

2 J p± v'2(j-l)TPv/2(7Tl) 


1±a/^ 


1_ I±i 
^ 2 


1 - i 


+ p±^{p-p-ij + i)) 

1 - i 1 - j 

/0(1Tj7'u) (-1 ± j7ti)0'+ 1) 2z;(-r;±j) 


1 - j 


P 


(8.25) 


Hence, modulo 27r, 


/ ,11 r w 2v(—V ±j) 

arg(0 (uo,+)) = — arctan--• — 2arg(uo,+) — 


Therefore, the direction of steepest descent is 


0 iff>0 
TT if f < 0. 


„g(„) = _ ^"M1 , +1 afciAi) + » ‘f«i» 

J 2 2 p 1 f if£<0. 

(8.26) 


By (8.24 1 and arccos Ijv = arctan ^/v'^ — 1 = arctan y/ {j — 1)/2, we conclude that 
I + i arctan 2’^(t+’^) _|_ arctan if -f < 0, 

1 + 5 ^arctan 2'»(t~^) _ arctan if -^ > 0. 


arg(w) = 


(8.27) 
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Figure 8.1: arg(r(;) — tt/2 as a function of Figure 8.2: arg(?ii) — tt/2 as a function of 
p for £ < 0 p for £ > 0 


There is nothing wrong in using plots here to get an idea of the behavior of arg(u>), 
since, at any rate, the direction of steepest descent will play only an advisory role in 
our choices. See Figures [8T| and [8^ 


8.4 The integral over the contour 


We must now choose the contour of integration. The optimal contour should be one on 


which the phase of the integrand in (8.13 1 is constant, i.e., S5((/)(u)) is constant. This 


is so because, throughout the contour, we want to keep descending from the saddle 
as rapidly as possible, and so we want to maximize the absolute value of the deriva¬ 
tive of the real part of the exponent —(j){u). At any point u, if we are to maximize 
\‘3i{d(j){u)/dt)\, we want our contour to be such that = 0. (We can also 

see this as follows: if 3(0(u)) is constant, there is no cancellation in (8.131 for us to 
miss.) 

Writing u = x + iy, we obtain from (8.12 1 that 


3(0(u)) = —xy + £x + T log -\/x^ + 


(8.28) 


We would thus be considering the curve '^{(j){u)) = c, where c is a constant. Since we 
need the contour to pass through the saddle point mo,+, we set c = S((^(mo,-i-))- The 
only problem is that the curve '^{(j){u)) = 0 given by (8.28 1 is rather uncomfortable to 
work with. 

Instead, we shall use several rather simple contours, each appropriate for different 
values of £ and r. 


8.4.1 A simple contour 

Assume first that £ > 0. We could just let our contour L be the vertical line going 
through Since the direction of steepest descent is never far from vertical (see 
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B), this would be a good choice. However, the vertical line has the defect of going 
too close to the origin when p —^ 0 . 

Instead, we will let L consist of three segments: (a) the straight vertical ray 

{(a:o,y) : V > Vo}, 


where xq = 3?uo,+ > 0 . ?/o = Q'mo,+ > 0 ; (b) the straight segment going downwards 
and to the right from uo,+ to the a:-axis, forming an angle of 7 r /2 — f3 (where /3 > 0 
will be determined later) with the cc-axis at a point (a:i, 0 ); (c) the straight vertical ray 
{(cci, y) : 2 / < 0}. Let us call these three segments Li, L 2 , L 3 . Shifting the contour in 
( |8.13| l, we obtain 

1 = J 

and so |/| < h + I 2 + I 3 , where 



\du\. 


(8.29) 


As we shall see, we have chosen the segments Lj so that each of the three integrals Ij 
will be easy to bound. 

Let us start with Ii. Since u > 0, 


where, by ( | 8 . 12 | i. 


poo 

h < |«o,+ r" / 

J Vl\ 


2 2 

4-iy) = --- iy -T arg{x + iy). 


(8.30) 


Let us expand the expression on the right of ( |8.30| l for x = xq and y around yo = 
Suo,+ > 0. The constant term is 


£ f ^ T — 1 

3fi(/)(uo.+) =-- 2/0 - Targ(uo.+) =- —(1 + t;(p)) - - arccos 


l + r;(p) 1 —1 

- 1 — arccos —— 

P 2 v[p) 


(8.31) 


T, 


where we are using (8.19 1 , (8.21 1 and ( 8.24| l. 

The linear term vanishes because mo,+ is a saddle-point (and thus a local extremum 
on L). It remains to estimate the quadratic term. Now, in (8.30 1 , the term arg(x + iy) 
equals arctan(?//a:), whose quadratic term we should now examine - but, instead, we 
are about to see that we can bound it trivially. In general, for fp, f G R and / G 


f{t) = fito) + f{to) ■ (t-to) + [ [ f"{s)dsdr. (8.32) 

Jto Jto 

Now, arctan"(s) = —2s/ (s^ + 1)^, and this is negative for s > 0 and obeys 


arctan"(—s) = — arctan"(s) 
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for all s. Hence, for fo > 0 t > —Iq, 

arctant < arctanfo + (arctan'fo) • {t — to)- 


(8.33) 


Therefore, in (8.30 1 , we can consider only the quadratic term coming from — a;^)/2 
- namely, (y — yoP /2 - and ignore the quadratic term coming from arg(x + iy). Thus, 

(8.34) 

for y > —yo, and, in particular, for y > yp. Hence, 

pca poo _ 

/ (8.35) 


3?(/)(xo + iy) > + ^(j){uo,+) 


Notice that, once we choose to use the approximation ( 8.33| l, the vertical direction is 
actually optimal. (In turn, the fact that the direction of steepest descent is close to 
vertical shows us that we are not losing much by using the approximation (8.33 1 .) 

As for |uo,+ |“‘^, we will estimate it by the easy bound 

|mo,+ | = =max(A/r,£), (8.36) 

where we use ( |8.22| ). 

Let us now bound l 2 - As we already said, the linear term at uo,+ vanishes. Let 
Uo be the point at which L 2 meets the line normal to it through the origin. We must 
take care that the angle formed by the origin, uo,+ and Uo be no larger than the angle 
formed by the origin, (xi, 0) and ug; this will ensure that we are in the range in which 
the approximation ( |8.33[ ) is valid (namely, t > —tg, where tg = tanag). The first 
angle is 7r/2 + /? — arg ug,+, whereas the second angle is 7r/2 — /3. Hence, it is enough 
to set /3 < (argMg _|_)/2. Then we obtain from (8.12i and (8.33 1 that 


iR</)(u) > 3?(/>(ug_ + ) — 3? 


(u - Ug, + )^ 


(8.37) 


If we let s = |it — Mg +1, we see that 


3fJ 

Hence, 

I2 < \Uo\~'^ 

< lUoT'" 


(m - Mg, + )2 


= -cos 




— — /3)) = — — cos 2/3. 


o-SR </>(«) 


Idul 


g-SR0(uo, + )-V _ I |-CTg-5R</.(iio, + ) 


2 cos 2/3' 


(8.38) 


Since arg Mg = arg Mg_+ — /3, we see that, by (8.21 1 , 
|mo| = 3?((xg + iyg) (cos/3 - isin/3)) 



cos /3 + 1 + 


j + 1 


sin/3 


(8.39) 
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The square of the expression within the outer parentheses is at least 

j - 1 


cos"^ /3 + ( 1 + ^ + \/2{j + 1) ) sin^ (3 + 


I ./j-1 


sin 2/? 


> ^ + - sin^ /3 — - cos^ /? + - sin^ /3. 
— 22 ^2 ^ 2 ^ 


If /3 > 7r/8, then tan /3 > 1/V^, and so, since j > p, we obtain 


^ J 




lWo| > ^J|(l+sin2/3) > ^y'l+sin^/S. 


We can also apply the trivial bound j > 1 directly to (8.39 1 . Thus, 


|uo| > max ( \ j -\Jl + sin^/?, .f sin/3J . 

Let us choose f3 as follows. We could always set /? = tt/8: since arguo,+ > 7r/4, we 
then have f3 < (argMo,+)/2, as required. However, if p < 3/2, then v{p) < 1.18381, 
and so, by (8.24 1 , arg uo,+ > 1.28842. We can thus set either /3 = 7r/6 = 0.523598 ... 
or /? = 7r/5 = 0.628318 ..., say, either of which is smaller than (argMo,+ )/2. Going 
back to (|8.38[), we conclude that 


/2 < + ) . 


21/4 


1 + sin 


/ ii/2 

TT 

£sin — 

— cr 

, ^/tt 

e 

J cos27r/5 

5 


2 


for p arbitrary, and 


when v{p) < 3/2. 

It remains to estimate I^. For u = xi, 

(w ~ = 1 (1 - t^^D) vi 


> (1 - ■ / (^1 + ^ h - - 


> - (1 - tan^ /3) T, 

where we are using ( 8.21| l. Thus, ( 8.37[ ) tells us that 

1 i”?! T1 ^ Fi 

> 3fi</(uo,+) +-^ — -T. 


At the same time, by (8.30 1 and r, £ > 0, 


+ iy) > 3?(/)(a;i) + — 
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for 2 / < 0. Hence 

J J — oo 


< |a;i| 


!Lg- i ^ r ^-‘!R4>{uo, + ) ^ 


Here note that xi > (tan^)|Mo,+ |, and so, by ( 8.36| l, 

x\ > tan P ■ max (v^, • 

We conclude that, for £ > 0, 


|/| < 1 + 2 * 




1 + sin"^ f 


-■iRMua. + ) 


(t 


an 


rf7/2 


(since (1 — tan^ 7r/8)/4 = (v^ — l)/2) and, when p < 3/2, 


|/| < 1 + min 2-^+^, + 


secf 


e 6 


VW2 


(sinf)"! (1/73)-) 


5R<2>(«o,+) 


We know 3fJ(/(Mo,+) from (8.31 1 . Write 


so that 


p/ N 1 1 '^’{P) - 1 

E{.p) = ^ arccos -, 

2 v[p) p 


, 1 + t;(p) 1 —1 TT 2 

-5i/)(Mo,+) = - - + X arccos —- = - - E{p) + 

P 2 v[p) 2 p 


(8.40) 


To finish, we just need to apply (8.111. It makes sense to group together r(s)e 2 ^, 
since it is bounded on the critical line (by the classical formula |r(l/2 + ir)| = 
^/t:/ cosh TFT , as in OMV07I Exer. C.l(b)]), and, in general, of slow growth on bounded 
strips. Using (8.11 1 , and noting that 27r^i5^ = 7/2 = (2/p) • t, we obtain 


i«(*)i < . A ';:'"'■ A' 

\c 2 ,a,T/^ for p< 3/2. 


(8.41) 


where 


Cl,cr,T — 2 I 1+2"' 


A^y 


1 + sin^ I 




(tan I)' 


e 6 


(sinf)") [l/AY 


(8.42) 
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We have assumed throughout that £ > 0 and r > 0. We can immediately obtain a 
bound valid for € < 0, r < 0, by reflection on the a;-axis; we simply put absolute 
values around r and £ in ( |8.41| l. 

We see that we have obtained a bound in a neat, closed form without too much 
effort. Of course, this effortlessness is usually in part illusory; the contour we have 
used here is actually the product of some trial and error, in that some other contours 
give results that are comparable in quality but harder to simplify. We will have to 
choose a different contour when sgn(f) ^ sgn(r). 

8.4.2 Another simple contour 

We now wish to give a bound for the case of sgn(£) ^ sgn(r), i.e., sgn((5) = sgn(T). 
We expect a much smaller upper bound than for sgn(f) = sgn(T), given what we 
already know from the method of stationary phase. This also means that we will not 
need to be as careful in order to get a bound that is good enough for all practical 
purposes. 

Our contour L will consist of three segments: (a) the straight vertical ray {(* 0 , y) : 
y > 0}, (b) the quarter-circle from (xq, 0) to (0, —xq) (that is, an arc where the argu¬ 
ment runs from 0 to —TrjT), and (c) the straight vertical ray {{0,y) : y < —a^o}- We 
call these segments Li, L 2 , T 3 , and define the integrals Ii, I 2 and I 3 just as in ( |8.29 1 . 
Much as before, we have 


nOO 

Jo 

Since ( |8.33| > is valid for t > 0, ( |8.34| ) holds, and so 

/ oo 

e-h(v-vo?dy = 

-OO 


By (8.121 and (8.301, 


4 < a:o 


e-^'PiOdu = 


^7r/2 


- -\-ixQ sin a+ra 


) da. (8.43) 


Now, for a > 0 and f < 0, 

{£xq sin a + ra)' = £xo cos a + t > £xq -f r. 


Since j = + < 1 + l‘^, we have < p/2, and so, by (8.211, 

\£xo\ < -f^p/d = T, and thus fxo -f r > 0. In other words, the exponent in (8.43 i 
equals (xq cos 2a) 12 minus an increasing function, and so, since 3?^(xo) = — Xo/2, 

/*7r/2 cos 2a 

h < ■ Xo J da = xj/'^ • -Xq • /o(a;o/2), 


where Io{t) = ^ is the modified Bessel function of the first kind (and 

order 0). 
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Since cos9 = y/l — sin^ 9 < 1 — (sin^ 9)/2 <1 — 29'^ jn'^, we hav^ 


r e 

TT Jo 




d9 < e* 


1 


d9 = e‘ 


/o 

for f > 0 . 

Using the fact that 3?^(a:o) = —x\l2, we conclude that 




ra^o • 


23/2 4 


By (8.34 1 , which is valid for all we know that 3fi0(a:o) > ^(j){uo^+). 
Let us now estimate the integral on L 3 . Again by ( |8.30| l, for y < 0, 

,2 


= y - -^2/+ 'r|- 


Hence 




/L 3 


< a^o 


f -(4-^v+^i] 


du 


— a ^ 

= Xn 62 6 


2 2(.y U (ly — 2 ;^ '^g Y J — 


2 ’ 


since y—.f < —£ for y < —Xq and e“* ^^dt < yJ-K j2-e~^ /2 (by 0AS64I 7.1.13]). 

Now that we have bounded the integrals over Li, and L 3 , it remains to bound 
Xq from below, starting from ( 8.21| . We will bound it differently for p < 3/2 and for 
p > 3/2. (The choice of 3/2 is fairly arbitrary.) 

Expanding {yjl +1 — 1)2 > 0, we obtain that 2(1 + f) — 2yjl + t > t for all 
f > — 1 , and so 


/ym-iV ^ f t 


-(yiTf-i)^ < 0 , 


t J V2a/1 +1 
i.e., {y/T+1 — l)/t decreases as t increases. Hence, for p < po, where po > 0, 


j(p) = vT+7 > 1 + 

Po 

which equals 1 + (2/9)(-\/r3 — 2)p2 for po = 3/2. Thus, for p < 3/2, 


(8.44) 


^m.n(xa-2}p‘ v/vT3-2,„ 

“=»-TV -i'—6—I'l'’ 


(8.45) 


is actually not hard to prove rigorously the better bound Iq (t) < 0.468823e^/V^. For t > S, this can 
be done directly by the change of variables cos0 = 1 — 2s^, dO = Idsj\/\ — s^, followed by the usage 
of different upper bounds on the the integrand exp(—2fs^/\/l ~ -5^) for 0 < 5 < l/2and 1/2 < 5 < 1. 
(Thanks are due G. Kuperberg for this argument.) For f < 8, use the Taylor expansion of /o(f) around 
f = 0 IAS64I (9.6.12)]: truncate it after 16 terms, and then bound the maximum of the truncated series by 
the bisection method, implemented via interval arithmetic (as described in j2.6| . 
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On the other hand. 




= ^(/(p)p-0-(p)-i)) = 


- (1 + p^) + \/l + 

+p2 


> 0 , 


and so, for p > 3/2, (/(p) — l)/p is minimal at p = 3/2, where it takes the value 
(yi3- 2)/3. Hence 


X0 = y 


,(,) - 1 ^ ^ > 0.51729,/r. (8.46) 


V6 


76 


We now sum /i, I 2 and I 3 , and then use ( |8.11| i; we obtain that, when f < 0 and 
r > 0, 


1^^771 < 




|r(s)l 


7^ 


a-<t>Mu-'^du 


< 


7ol'" ((1 + e-7V(7|. 


(8.47) 


By ( |09l l, dOTl l and (|8^, 

T It 1 tt 

-5R(^(mo.+)) = ^(1 - ^(7) + 2 < 2 - 4”^' 

We conclude that, when sgn(f) ^ sgn(T) (i.e., sgn((5) = sgn(T)), 

\Fs{s)\ < |:ro|-'^ • e-7V(7|e5' ((l + ^) , 


where xq can be bounded as in (8.45 1 and (8.46 1 . Here, as before, we reducing the case 
T < 0 to the case r > 0 by reflection. This concludes the proof of Theorem|8.0.1| 


8.5 Conclusions 


We have obtained bounds on |F 5 (s)| for sgn((5) ^ sgn(r) (8.41 1 and for sgn((5) = 
sgn(r) (|8.47|. Our task is now to simplify them. 


First, let us look at the exponent E{p), dehned as in (8.2i. Its plot can be seen in 
Figure [83] We claim that 


E{p)> 


0.1598 ifp>1.5, 

0.1065p ifp<1.5. 


(8.48) 


This is so for p > 1.5 because E{p) is increasing on p and iJ(1.5) = 0.15982 .... The 
case p < 1.5 is a little more delicate. We can easily see that arccos(l — 7/2) > t for 
0 < f > 2 (since the derivative of the left side is 1/ \J\ — f^/4, which is always > 1). 
We also have 

2 4 2 
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for 0 < p < Vs, and so 


8 128 - - 8 

for 0 < p < \/32/5; this, in turn, gives us that l/t>(p) < 1 — p^/8 + 7/5^/128 (again 
for 0 < p < v^32/5), and so l/v{p) < 1 - (1 - 1 lM)p^ jS for 0 < p < 1/2. We 
conclude that 

1 1 

arccos 


therefore. 


;(p) - 2 V 64' 


E{p) >y^P-^> 0.11093P > 0.1065p. 


In the remaining range 1/2 < p < 3/2, we prove that E[p)/p > 0.106551 using 
the bisection method (with 20 iterations) implemented by means of interval arithmetic. 
This concludes the proof of ( |8.48 1 . 

Assume from this point onwards that |t| > 20. Let us show that the contribution 
of (|8.3[) is negligible relative to that of (|8.1|l. Indeed, 


(( 


1 + ^ ^ I < _e-o-i598^ 

^+23/2^® ^2 j - 106 


It is useful to note that e = e and so, for ct < fc + 1 and p < 3/2, 


= -2t/p 


=-40/p 


< 


1 

< — 


(0.84473|T|/f)'^ - {M^pY £cr - 1^0.84473-1.5/ 


g-so/Dt) 


1 f>-S0/(3t) 

< — ■ 3.15683'=+^- 


(8.49) 


tk+i ’ 
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where t = 2p/3 < 1. Since e its maximum at t = c/{k + 1), 


< g-(fe+i) ^ 3(fc + l) 


i-k+i 


\ 80 


k+l 


and so, for p < 3/2, 


r 0.04355 if0<cr<l, 
|a;or‘"e-5^" < ^ • < 0.00759 if 1 < ct < 2, 
[ 0.00224 if2<cr<3. 


whereas \xq \-^< \xq\-'^ < (0.51729Vr)-'^ for p > 3/2. 

We conclude that, for |t| > 20 and cr < 3, 

\Fs{s)\ < |r(s)|et^ ■ | J (8-50) 

lip >3/2 

provided that sgn((5) = sgn(r) or <5 = 0. This will indeed be negligible compared to 
our bound for the case sgn(i5) = — sgn(T). 

Let us now deal with the factor |r(s)|e5'^. By Stirling’s formula with remainder 
term IIGR94I (8.344)], 

logr(s) = ^log(27r) + - 0 logs - s + +^ 2 ( 5 ), 


where 


|i?2(s)| < 


1/30 


12|s|3c0s3(5^) 
for 3?(s) > 0. The real part of (s — 1/2) log s — s is 


V2 

180|s|3 


(cr - l/2)log|s| - rarg(s) - a = {a - l/2)log|s| - + r ( arctan ^ ^ ) 

2 V rl h\J 

for s = a + IT, a > 0. Since arctan(r) < r for r > 0, we conclude that 

|r(s)|e5^ < V^|sr"5eT^ + i^^. (8.51) 

Lastly, |s|‘"-i/2 = + icr/rl'^-i/^. Por |t| > 20, 


r 1.000625 
/iLicj/r/’-^l'^ < \ 1.007491 
[ 1.028204 


if 0 < cr < 1, 
if 1 < cr < 2, 
if 2 < cr < 3 


eT5tvT + l5^ < 1.004177. 


and 
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Thus, 


2.51868 

if 0 < cr < 1 , 

2.53596 

if 1 < cr < 2 , 

2.5881 

if 2 < cr < 3. 

and C 2 ,cr 

^ in (8.2 1 . By |r| > 20, 


(8.52) 


e ('^2 < 0.015889, < 0.035674. 

Since 8 sin(7r/8) = 3.061467... > 1, we obtain that 

{ 1.30454 if 0 < cr < 1, r 1.94511 if 0 < cr < 1, 

1.58361 if 1 < cr < 2, C 2 ,a,r < < 3.15692 if 1 < cr < 2, 
1.98186 if 2 < cr < 3, [ 5.02186 if 2 < cr < 3. 

Lastly, note that, for fc < cr < fc + 1, we have 

|.^|<r-l/2 ^ < ^fe/2^ 


(8.53) 


1 


7-'r'/2 

whereas, for p < 3/2 and 0 < 7 < 1, 




Hi)' 


< |t| 2 M ) < 2O2 2 




7/2 


and so 


t-1/2 _ 




|t-|{'^}-1/2 


{<r} 


< 


< 




1/2 


Multiplying, and remembering to add (|8.501l, we obtain that, for fc = 0,1, 2, cr S 
[ 0 , 1 ] and |r| > 20 , 


|T(5(s + A:)| + |L)5((1 


s) + k)\ < 





ifp < 3/2, 
ifp > 3/2, 


where 

Ko,o < (4 • 10"^ + 1.94511) • 2.51868 • yHl/S < 3.001, 

Ki.o < (4 • 10"^ + 3.15692) • 2.53596 • y/s/S < 4.903, 

K 2.0 < (4 • 10"^ + 5.02186) • 2.5881 • vWs < 7.96, 


and, similarly. 


Ko.i < (6 • 10"^ + 1.30454) • 2.51868 < 3.286, 
Ki.i < (6 • 10"® + 1.58361) • 2.53596 < 4.017, 
K 2 ,i < (6 • 10"® + 1.98186) • 2.5881 < 5.13. 


This concludes the proof of Corollary |8.0.2| 










Chapter 9 


Explicit formulas 


An explicit formula is an expression restating a sum such as 5^ x) as a sum of 

the Mellin transform Gs{s) over the zeros of the L function L(s, x)- More specifically, 
for us, Gs{s) is the Mellin transform of r]{t)e{6f) for some smoothing function p and 
some (5 S K. We want a formula whose error terms are good both for 6 very close or 
equal to 0 and for S farther away from 0. (Indeed, our choice(s) of ry will be made so 
that Fs{s) decays rapidly in both cases.) 


We will be able to base all of our work on a single, general explicit formula, namely. 
Lemma [9. 1.1 1 This explicit formula has simple error terms given purely in terms of a 
few norms of the given smoothing function rj. We also give a common framework for 
estimating the contribution of zeros on the critical strip (Lemmas 9.1.3|and|9.1.4i. 


The first example we work out is that of the Gaussian smoothing 77 (f) = 

We actually do this in part for didactic purposes and in part because of its likely ap¬ 
plicability elsewhere; for our applications, we will always use smoothing functions 
based on te~* and generally in combination with something else. Since 

77 (f) = does not vanish at f = 0, its Mellin transform has a pole at s = 0 

- something that requires some additional work (Lemma |9.L2t see also the proof of 
Lemma[9.Ll|l. 


Other than that, for each function 77 (f), all that has to be done is to bound an integral 
(from Lemma 9.L3| l and bound a few norms. Still, both for ry* and for 7 y+, we find a 
few interesting complications. Since ry_|_ is defined in terms of a truncation of a Mellin 
transform (or, alternatively, in terms of a multiplicative convolution with a Dirichlet 
kernel, as in (7.4 1 and (7.6 1 ), bounding the norms of 7 y+ and 77 )^ takes a little work. We 
leave this to Appendix ApThe effect of the convolution is then just to delay the decay 
a shift, in that a rapidly decaying function /(r) will get replaced by /(t — iJ), H a 
constant. 


The smoothing function 77 * is defined as a multiplicative convolution of f^e * 
with something else. Given that we have an explicit formula for f^e“* we obtain an 
explicit formula for 77 * by what amounts to just exchanging the order of a sum and an 
integral. (We already went over this in the introduction, in (|1.40|l.) 
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CHAPTER 9. EXPLICIT FORMULAS 


9.1 A general explicit formula 


We will prove an explicit formula valid whenever the smoothing rj and its derivative p' 
satisfy rather mild assumptions - they will be assumed to be L 2 -integrable and to have 
strips of definition containing {s : 1/2 < 5ft(s) < 3/2}, though any strip of the form 
{s : e < 3?(s) < 1 + ej would do just as well. 

(For explicit formulas with different sets of assumptions, see, e.g., lUKiMl §5.5] and 
OMV07I Ch. 12].) 

The main idea in deriving any explicit formula is to start with an expression giving 
a sum as integral over a vertical line with an integrand involving a Mellin transform 
(here, Gs{s)) and an L-function (here, L{s, x))- We then shift the line of integration to 
the left. If stronger assumptions were made (as in Exercise 5 in OIK041 §5.5]), we could 
shift the integral all the way to 5ft(s) = —c»; the integral would then disappear, replaced 
entirely by a sum over zeros (or even, as in the same Exercise 5, by a particularly simple 
integral). Another possibility is to shift the line only to 5ft(s) = 1/2 + e for some e > 0 
- but this gives a weaker result, and at any rate the factor L'{s, x) /L{s, x) can be large 
and messy to estimate within the critical strip 0 < 3?(s) < 1. 

Instead, we will shift the line to 3?s = —1/2. We can do this because the assump¬ 
tions on rj and p' are enough to continue G' 5 (s) analytically up to there (with a possible 
pole at s = 0). The factor L'{s, x)/L{s, x) is easy to estimate for Jfts < 0 and s = 0 
(by the functional equation), and the part of the integral on = —1/2 coming from 
Gs{s) can be estimated easily using the fact that the Mellin transform is an isometry. 

Lemma 9.1.1. Let rj : K]} —>■ K fee in C^. Let x £ M“'", 5 S M. Let x be a primitive 
character mod q, q > 1. 

Write Gsis) for the Mellin transform of r](t)e{St). Assume that r](t) and rjft) are 
in (with respect to the measure dt) and that r]{f)G~^ and rj'{f)G~^ are in l\ (again 
with respect to dt) for all a in an open interval containing [1/2, 3/2]. 

Then 


where 


^ f S \ 

^A(n)x(n)e f -n j rj{nlx) = Iq=i ■ p(-5)x - ^ Gs{p)3 

n—1 ^ ^ p 

- R + O* {{\ogq + 6M) ■ (|p'| 2 + 27r|(5||?7|2))a;"^/^, 

1 ifq = 1, 


/,= ! = 


0 ifq f I, 


R = 77(0) ( log ^ + 7 - ) + O*(co) 


for q > 1, R = 77 ( 0 ) log 2^ for q = I and 


CO = 3O* 


Tj'if) 


Vt 


■q'(f)s/i + 27 r|( 5 | 




Vi 


+ Wt)Vti 


(9.1) 


(9.2) 


(9.3) 


The norms \q\ 2 , IW W (t)/ Vi\i, He., are taken with respect to the usual measure dt. 
The sum is a sum over all non-trivial zeros p of L(s, x)- 
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Proof. Since (a) is in £i for cr in an open interval containing 3/2 and (b) 

rj{t)e{5f) has bounded variation (since 77 , 77 ' S fi, implying that the derivative of 
rj{t)e{5f) is also in fi), the Mellin inversion formula (as in, e.g., OIK04I 4.106]) holds: 

1 

ri{n/x)e{6n/x) = -— / Gs{s)x‘^n~'^ds. 

2m 

Since Gs{s) is bounded for 5ft(s) = 3/2 (by € if) and 'Ylm is 

bounded as well, we can change the order of summation and integration as follows: 


^ ^ -j ^^-1-200 

V A{n)xin)e{6n/x)r]{n/x) = V A{n)x{n) ■ — / Gsis)x''n~‘'ds 

y3__ 

1 /*§+200 C )0 

= — / ^ A(n)x(n)G5(s)a;''7r“®ds (9.4) 

27r7 


> n—1 

^ 3 




27r7 


L'{s,x) 

Hs,x) 


Gs{s)x^ds. 


(This is the way the procedure always starts: see, for instance, 0HL22I Lemma 1] or, 
to look at a recent standard reference, IIMV07I p. 144]. We are being very scrupulous 
about integration because we are working with general 77 .) 

The first question we should ask ourselves is: up to where can we extend Gs{s)l 
Since is in £i for cr in an open interval I containing [1/2, 3/2], the transform 

Gs{s) is defined for 3?(s) in the same interval /. However, we also know that the 
transformation rule M{tf'{t)){s) = —s ■ Mf{s) (see (2.10i; by integration by parts) 
is valid when s is in the holomorphy strip for both M{tf'{f)) and Mf. In our case 
(/(f) = f}{f)e{5t)), this happens when 5ft(s) € (/ — 1 ) n / (so that both sides of the 
equation in the rule are defined). Hence s ■ Gs{s) (which equals s ■ Mf{s)) can be 
analytically continued to 3fi(s) in (/ — 1) U I, which is an open interval containing 
[—1/2,3/2]. This implies immediately that Gs{s) can be analytically continued to the 
same region, with a possible pole at s = 0 . 

When does Gs{s) have a pole at s = 0? This happens when sGs{s) is non-zero at 
s = 0, i.e., when M(f/'(f))(0) 0 for f(t) = r]{t)e{St). Now 


poo 

M(f/'(t))(0) = / f{t)dt = lim f{t) - /(O). 

Jo 

We already know that /'(f) = {d/dt){r]{t)e{6t)) is in £ 1 . Hence, limt_,.oo /(f) exists, 
and must be 0 because / is in £ 1 . Hence —M(f/'(f))(0) = /(O) = ? 7 ( 0 ). 

Let us look at the next term in the Laurent expansion of Gs{s) at s = 0. It is 


lim 

s—>-0 


sGsis)-rjiO) -M(f/'(f))(s)-/( 0 ) ,. 1 

--= lim ■ ■ ' = — lim - / 

S s —>^0 S s—^0 S Ir\ 




ps _ -| roo 

f'(f) lim- dt = — /'(f) logf dt. 

s^o s ./n 


l)dt 
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Here we were able to exchange the limit and the integral because f'{t)T^ is in 
for tr in a neighborhood of 0; in turn, this is true because f'{t) = r]'{t) + 2TriSrj{t) 
and r]'{t)U and ri(t)U are both in £i for cr in a neighborhood of 0. In fact, we will 
use the easy bounds |? 7 (f)logf| < (2/3)(|77(f)f“^/^|i + |? 7 '(f)logf| < 

(2/3)(|77'(f)f“^/^|i + |77'(f)f^/^|i), resulting from the inequality 


(t 2 +^2^ < |logi|, 


valid for all f > 0. 

We conclude that the Laurent expansion of Gs{s) at s = 0 is 

„ / X ^(0) 

Gs{s) — - h Co + CiS + . . . , 


(9.5) 


(9.6) 


where 


Co = 0*{\f{t) logf|i) 

v'it) 


2 

= O* 
3 


Vi 


l'{t)Vi 


+ 2tt5 


yjt) 

Vi 


Wt)Vi\i 


We shift the line of integration in (9.4 1 to 3?(s) = —1/2. We obtain 


1 p2+ioo Til \ 

/ —r^Gs{s)x^ds = Ig=iGsil)x - V Gs{p)xP - R 
27 rz J2-*oo L{s, x) ^ 


1 

2TTi 


f’—l/2+ioo Ti 


where 


Of course. 


R = Res, 


5=0 


—1/2—ioo 

L'{s,x) 


L'{s,x) 

L{s,x) 


L{s,x) 


Gsis). 


(9.7) 


Gs{s)x^ds, 


pOO 

G5(1) = M{r]{t)e{St)){l) = / r]{t)e{St)dt = fj{-S). 

Jo 

Let us work out the Laurent expansion of L'{s, x)/L{s, %) at s = 0. By the func¬ 
tional equation (as in, e.g., IIIK04I Thm. 4.15]), 

L'V^x) 1 tt 1/s-bK^^ 1 fl-s + K\ L'{l-s,x) 

where i^{s) = r^(s)/r(s) and 




0 ifx(-l) = l 
1 ifx(-l) = -l. 
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By '0(1 — x) — '0(a;) = TrcotTrx (immediate from r(s)r(l — s) = tt/ sinTrs) and 
0(s) +0(s + 1/2) = 2{0(2s) — log 2) (Legendre; IIAS64I (6.3.8)]), 


( V” 


S + K 


+ Ip 


1 — S + K 


= -0(l-s)+log2+^cot ^^'^^^^ (9.9) 


Hence, unless q = 1, the Laurent expansion of L'{s^ x) at s = 0 is 

1 — tv 


ill ,1,1 


Oi 02 

s 


Here 0(1) = —7, the Euler gamma constant 0AS64I (6.3.2)]. 

There is a special case for q = 1 due to the pole of 0(s) at s = 1. We know that 
0'(O)/C(O) = log27r (see, e.g., IIMV07I p. 331]). 


From this and (9.6 1 , we conclude that, if 77 ( 0 ) = 0, then 
R = 


Co if q > 1 and x(~l) = 
0 otherwise. 


where cq = 0*(|p'(f) logt|i + 27r|5||77(f) logf|i). If p(0) 0 0, then 


i! = ,(0)(log^+7-L/A 


Co ifx(-l) = l 
0 otherwise. 


for g > 1 , and 


for g = 1 . 


R = r]{0) log27r 


It is time to estimate the integral on the right side of (9.7 1 . For that, we will need to 
estimate L'{s, x)/L{s, x) for 3?(s) = —1/2 using (|9.8| and ^3 1 . 

If 3?(z) = 3/2, then \t^ + 0^1 > 9/4 for all real t. Hence, by OOLBCIOI (5.9.15)] 
and ||GR94i (3.411.1)]. 


1 r 

0(z) = iogz- 


tdt 


= log z - — + 2 • (9 
2 z 


= log,-- + -0- 

= iog.-i + Lo- 


0 — 1) 

tdt 


|(e 2 -‘-l) 


tdt 


(27r) 


- 1 


rr( 2 )C( 2 ) 


(9.10) 


1 


2 z 


1 


= \ogz- — +0* — =logz + 0* — 


27 


10 


27 


Thus, in particular, 0(1 — s) = log(3/2 — it) + (9*(10/27), where we write s = 
1/2 + *r. Now 

7r(s + k) 

cot- 

2 


g 7 4 * 2 e* 4 2 

g4F4'*~2''^ — 
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Since 3?(s) = —1/2, a comparison of Dirichlet series gives 

L'{1 - s,x) 


^(1 - s,x) 


|C'(3/2)| 

- |<(3/2)| - 


(9.11) 


where ^^(3/2) and C(3/2) can be evaluated by Euler-Maclaurin. Therefore, (9.8 1 and 
(9.91 give us that, for s = —1/2 + ir, 

L'{s,x) 


L{s,x) 


< 


< 


1 9 

log- 

TT 

log- 

TT 


log 


3 

-+IT 


+ ^+log2 +I+ 1.50524 


+ -log r2 + - +4.1396. 


(9.12) 


Recall that we must bound the integral on the right side of ( |9.7| l. The absolute value 
of the integral is at most times 


27r 


C—k+ioo 


— ^—200 


L'{s,x) 

L{s,x) 


Gs{s) 


ds. 


(9.13) 


By Cauchy-Schwarz, this is at most 


\ 


27r 


p— h+ioo 


— ^ —too 


L'(s,x) ^ 1 
L(s,x) s 


|(is| 


\ 


p—i+200 


27T 


|G5(5)s| \ds\ 


— ^—too 


By (9.121, 


/.-i+ioo 

L'{s,x) 1 

J — ^ —200 

L{s,x) s 


|ds| < 


r-H*- logg 2 


\ 


|ds 


f°° 11 log (t 2 + I) +4,1396 + log TTI 

\ J-oo i + 

< log q + ■\/226.844, 

where we compute the last integral numerically|^ 

Again, we use the fact that, by ( |2.10| , sGs{s) is the Mellin transform of 

d{e{5t)r]{t)) 


-dr 


— t- 


dt 


= —2TriSte{6t)r]{t) — te{St)r]'{t) 


(9.14) 


Hence, by Plancherel (as in (2.6 1 ), 




p— h+ioo 


27r 


— o—200 


\Gs{s)s\^ \ds\ = ^ j \—27riSte{St)T]{t) — te{St)T]'{t)\^t "^dt 

= MtWdt + \'n'{t)\‘^dt. 

(9.15) 


^By a rigorous integration from r = —100000 to r = 100000 using VNODE-LP INed061 , which runs 
on the PROFIL/BIAS interval arithmetic package IKnii99l . 
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Thus, ( |9.13| l is at most 


logq- 


226.844 


27r 


(iTy'la + 27r|(5||?7|2). 


□ 


Lemma 9.1.1 leaves us with three tasks: bounding the sum of Gs{p)x'’ over all 


non-trivial zeroes p with small imaginary part, bounding the sum of Gs{p)xP over all 
non-trivial zeroes p with large imaginary part, and bounding L'(l, x)/L(l, x). Let 
us start with the last task: while, in a narrow sense, it is optional - in that, in the 
applications we actually need (Thm. |7.1.2[ Cor. |7.1.3| and Thm. 7.1.4| l, we will have 
77 ( 0 ) = 0, thus making the term L'{1, x)/L(l, x) disappear - it is also very easy and 
can be dealt with quickly. 

Since we will be using a finite GRH check in all later applications, we might as 
well use it here. 


Lemma 9.1.2. Let xbe a primitive character mod q, q > 1. Assume that all non-trivial 
zeroes p = a it of L{s, x) with |t| < 5/8 satisfy iR{p) = 1/2. Then 




< ^\ogM{q) c, 


where M{q) 


max., 




and 


Proof. By a lemma of Landau’s (see, e.g., IIMV07I Lemma 6.3], where the constants 
are easily made explicit) based on the Borel-Caratheodory Lemma (as in IIMV071 
Lemma 6.2]), any function / analytic and zero-free on a disc Csa,R = {s : |s — so| < 
i?} of radius i? > 0 around sg satisfies 

-W)=°\-'\R-rV j 

for all s with |s — SqI < r, where 0 < r < R and M is the maximum of \f{z)\ on 
C'so.fl. Assuming L{s, x) has no non-trivial zeros off the critical line with 155(3) | < H, 
where H > 1/2, we set Sg = 1/2 -f 77, r = 77 — 1/2, and let R —>■ 77“. We obtain 


L'ifx) 

Hhx) 


(^877 log 


|£(s,x)| \ 

l-^2(so,x)l / 


(9.17) 


Now 


|A(so,x)| > ^ 

p 


(l_p-2.o)-l 
(1 — 


C(23o) 
C(so) ' 


Since sg = 1/2-1-77, Gsq,h is contained in {s € C : 3?(s) >1/2} for any value of 77. 
We choose (somewhat arbitrarily) 77 = 5/8. 
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By partial summation, for s = a + it with 1/2 < cr < 1 and any N G 


x)= xMn ® - I XI 1 (^ + 1) 

\-s+l 


T.<iV 


t<N 


X X I * - (^+1) 

n>N-\-l Ym<n 


(9.18) 


= O* 


+ + M{q)N- 


1 - 1/2 

where M(q) = max„ J2m.<n x(™) ■ set N = M{q)/2>, and obtain 

\L{s,x)\ < 2 M(q)lV-i /2 = 2V3v/mM- 

We put this into (|9.17|l and are done. 


(9.19) 

□ 


Let M{q) be as in the statement of Lem. 9.1.2 Since the sum of x(^) (x uiodg, 
g > 1) over any interval of length q is 0, it is easy to see that M{q) < q/2. We also 
have the following explicit version of the Polya-Vinogradov inequality: 


M{q) < 


r^glogg-f ^^gloglogg-f ifx(-l) = l, 


^V^logg-f ^^log\ogq + ^ 
Taken together with M{q) < q/2, this implies that 

M{q) < q^!^ 


ifx(-l) = 1 - 


for all g > 1 , and also that 

for all g > 1 . 

Notice, lastly, that 


M{q) < 2g3/® 


(9.20) 

(9.21) 

(9.22) 


1 ^TT 

log-h 7 

9 


< log g -f log 


e'^ ■ 2 -k 

32 


for all g > 3. (There are no primitive characters modulo 2, so we can omit g = 2.) 
We conclude that, for x primitive and non-trivial. 


, Stt L'1,x 
log — + 7 - ——- 
9 L{l,x) 


< log ^ + logg + ^ loggs -I- 15.07017 

< 3 log g + 15.289. 


Obviously, 15.289 is more than log27r, the bound for x trivial. Hence, the absolute 
value of the quantity R in the statement of Lemma|9.Ll|is at most 


|?7(0)|(31ogg+ 15.289)+ |co| 


(9.23) 
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for all primitive x- 


It now remains to bound the sum ^ Gs{p)xP in (9.1 1 . Clearly 


'^Gs{p)xP 


<Y.\Gsip)\-x^^^\ 

p 


Recall that these are sums over the non-trivial zeros p of L(s, x)- 

We first prove a general lemma on sums of values of functions on the non-trivial 
zeros of L{s, x)- This is little more than partial summation, given a (classical) bound 
for the number of zeroes N(T, x) of L(s, x) with |3(s) | < T. The error term becomes 
particularly simple if / is real-valued and decreasing; the statement is then practically 
identical to that of ILeh66l Lemma 1] (for x principal), except for the fact that the error 
term is improved here. 

Lemma 9.1.3. Let / : IR+ C be piecewise C^. Assume limt_).oo /(t)flogf = 0. 
Let X be a primitive character mod q, q > 1; let p denote the non-trivial zeros p of 
L{s,x)- Then, for any y > 1, 


5 .= 

+ (l/(y)l5x(2/) + \f'{T)\■ g^{T)dT^ , 


p non-trivial 
Q(p)>V 


(9.24) 


where 


5 x(T) =0.5 log gT+17.7 


(9.25) 


If f is real-valued and decreasing on [y, oo), the second line of {9.24) equals 


O* 



Proof. Write N{T, x) for the number of non-trivial zeros of L(s, %) satisfying |3(s) | < 
T. Write N^{T,x) for the number of (necessarily non-trivial) zeros of L{s,x) with 
0 < 3(s) < T. Then, for any / : M+ — >■ C with / piecewise differentiable and 

lim4_>oo f{t)N{T,x) = 0, 


p:Q(p)>y 


fiT) dN+iT,x) 
f{T){N+{T,x)-N+{y,x))dT 

J 

pOO 

/ f{T){NiT,x)-N{y,x))dT. 


Now, by IIRos41l Thms. 17-19] and IMcC84al Thm. 2.1] (see also OTrul Thm. 1]), 
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for T > 1, where g^iT) is as in (9.25 i. (This is a classical formula; the references 
serve to prove the explicit form (9.251 for the error term g^{T).) 

Thus, for y >1, 




p:S(p)>y 


1^0* ( |/(y)|5x(y) 


|/'(r)|. 5 ^(T)dT . 


(9.27) 


Here 


-1 r i It) =4 r 

If / is real-valued and decreasing (and so, by limt_>,oo /(f) = 0, non-negative), 

poo poo 

\fiy)\9xiy) + / inn ■ g^(T)dT = f{y)g^{y) - / r{T)g^{T)dT 

J y J V 

= 0.5 


m 

T 


dT, 


since g' (T) < 0.5/T for all T > Tq. 


□ 


Let us bound the part of the sum Gs{p)x^ corresponding to p with bounded 
|3(p)|. The bound we will give is proportional to >/To logtfTo, whereas a very naive 
approach (based on the trivial bound \Gs{o- + ir)| < |G'o(cr)|) would give a bound 
proportional to Tq log qTq. 

We could obtain a bound proportional to x/To log qTq for 77 (f) = f*e“‘ G by using 
Theorem |8.0.1 Instead, we will give a bound of that same quality valid for 77 essentially 
arbitrary simply by using the fact that the Mellin transform is an isometry (preceded by 
an application of Cauchy-Schwarz). 


Lemma 9.1.4. Let rj : K)}" —>■ K such that both rj(t) and {\ogt)r]{t) lie in Li n L 2 
and g(t)/s/t lies in Li (with respect to dt). Let 5 S M. Let Gs{s) be the Mellin 
transform of ri{t)e(St). 

Let X be a primitive character mod q, q > I. Let Tq > 1. Assume that all non¬ 
trivial zeros p of L(s, x) with |3(p)| < Tq lie on the critical line. Then 


E nni 

p nan-trivial 
\?S(p)\<To 


is at most 

(hb + \v ■ log| 2 )\/ 7 bloggTo -f (17.21|?7 • logb - (log27rVe)|?7|2)v^ 
p{t)/Vt • (1.32 log g-I-34.5) 


(9.29) 















9.1. A GENERAL EXPLICIT EORMULA 


173 


Proof. For s = 1/2 + ir, we have the trivial bound 

r°° df 

|G 5 (s)| < = r]{t)/Vt 

where Fs is as in (|9.47|l. We also have the trivial bound 


(9.30) 


|G',(s)| = 




r°° df 

Jo ^ 


(9.31) 

for s = cr + ir. 

Let us start by bounding the contribution of very low-lying zeros (|3(p)| < 1). By 
and ( |9^ , 

7V(l,x) = -log;^ -f O* (0.5logg-f 17.7) = O*(0.8191ogg-f 16.8). 

TT zTre 


Therefore, 


^ \G 5 ip)\< • (0.819 log (Z+ 16.8). 

p non-trivial 

|a(p)l<i 


Let us now consider zeros p with |3(/5)| > 1. Apply Lemma 9.1.3 with y = 1 and 

m = 


J|G5(l/2 + zf)| iff < To, 
if f > Tq . 


[0 

This gives us that 

E fmp)) = lJ\{T)\og^^dT 

+ o* (^|/(l)|flx(l) + \fiT)\ ■ 9xiT) dT 


p:l<|St(p)|<To 


(9.32) 


where we are using the fact that / (cr + it) = f{a — ir) (because rj is real-valued). By 
Cauchy-Schwarz, 




f] -nr. 
2 ti J 


Now 

1 

IT 


fTo 1 j-ca 

|/(T)pdT< - / 


Gs[^+iT 


' poo 

dT< \e{6t)T]{t)\'^dt = \r]\l 
Jo 


by Plancherel (as in ( |2.6| l). We also have 

rTo 


qT 


27T 


<}T'o 

2tt f 2 


log 7 ^ dT < — / (logf) df < log -— -1-1 • To. 


q Jo 


qTo 


27re 
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Hence 




Again by Cauchy-Schwarz, 


|/'(r)|. 5 ^(T) dT<.- 


|/'(r)|2dr. / “ |5^(r)|2dr. 


Since |/'(T)| = |G^(l/2 + iT)\ and [MTj)'{s) is the Mellin transform of log(f) 
e{5t)ri{t) (by (|2.10[)), 


1 r°° 


Much as before, 

fTo 


\g^iT)\^dT< / (0.51oggT+17.7)^dT 


= (0.25(logqro)2 + 17.2(loggro) + 296.09)To. 


Summing, we obtain 


1 

TT 


[ °fiT) log gdT + ^ |/(r)| • g^(T) dT 

log + 17.21^ |r 7 (f)(logt)| 2 ^ y/% 


Finally, by ( 9.30| l and ( |9.25| l, 

|/(l)l5'x(l) < • (0.5logg +17.7). 


By (9.32 1 and the assumption that all non-trivial zeros with |5(p)| < Tg lie on the line 
3?(s) = 1/2, we conclude that 

XI \Gsip)\< {\v \2 + \v^og\ 2 )^/%\ogqTo 


p non-trivial 
l<|S(p)|<To 


+ (17.21|77 • log |2 - (log27rVe)|p|2)\/7|) 


+ 


r]{t)/y/i • (0.5 log g + 17.7). 


□ 


All that remains is to bound the contribution to ^ Gs{p)x^ corresponding to all 


zeroes p with |3(p)| > Tq. This will do by another application of Lemma 9.1.3 


combined with bounds on Gs{p) for 5(p) large. This is the only part that will require 
us to take a look at the actual smoothing function rj we are working with; it is at this 
point, not before, that we actually have to look at each of our options for rj one by one. 
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9.2 Sums and decay for the Gaussian 

It is now time to derive our bounds for the Gaussian smoothing. As we were saying, 
there is really only one thing left to do, namely, an estimate for the sum I^i5(p)l 
over all zeros p with |3(p)| > Tq. 

Lemma 9.2.1. Letp^it) = Letx G K'*', 5 S K. Let xbe a primitive character 

mod q, q > I. Assume that all non-trivial zeros p of L{s, x) with |3(p)| < Tg satisfy 
5ft(s) = 1/2. Assume thatTg > 50. 

Write Fs{s) for the Mellin transform of rj{t)e{5f). Then 

V \Fsip)\ < log^ • ■ 

p 27r V Jg J 

mp)\>To 

Here we have prefe rred to give a bound with a simple form. It is probably feasible 
to derive from Theorem 8.0.1 a b ound essentially prop ortio nal to where p = 

Tq/{'k5)'^ and E[p) is as in ( |8.2| l. (As we discussed in (8.5 E[p) behaves as 
for p large and as for p small.) 

Proof. First of all. 




p 

mp)\>To 


P 

a(p)>To 


by the functional equation (which implies that non-trivial zeros come in pairs p, 1 — p). 
Hence, by a somewhat brutish application of Cor. |8.0.2| 


E \Fs{p)\< E 


p 

mp)\>To 


p 

9(p)>To 


(9.33) 


where 


/(r) = 3.001e"°-^“®(^?) + 3.286e"°-^®®®l^l. 

Obviously, fir) is a decreasing function of r for t > Tq. 

We now apply Lemma 9.1.3 We obtain that 


(9.34) 


L + ms, 

S(p)>To 

We just need to estimate some integrals. For any y > 1, c,ci > 0, 

poo yo \ / 




*dt 


1 


■ Cl 


-dt 


(iogj/)e 


1 

- + Cl 

c 


c 


Ei{cy), 
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where Ei{x) = e *dt/t. Clearly, Ei{x) < e *dt/x = e ^/x. Hence 


•^v 

We conclude that 

pOO 

/ ^-0.1598t 


(\ogt+ y) e < \ \ogy+ f - + ci j - 


i\ e-‘^y 

y, 


'To 


1 , 7^ 1 

— log-- 

2tt ^ 2tt At 


dt 


< 


1 

2tt 

1 


'To 


logC 


/2\„- 




27rc 


e-^*dt (9.36) 


27rc 


= ?;— log^o + log — + - + - — e 


27r 


1 7r\ 1 


2/ Tn 


3 —cTo 


with c = 0.1598. Since Tq > 50 and q > 1, this is at most 

1.0721og^e"^^«. 

ZTT 

Now let us deal with the Gaussian term. (It appears only if Tq < {3/2){tt6)‘^, as 
otherwise |t| > (3/2)(7rd)^ holds whenever |r| > Tq.) For any y > e, c > 0, 


(9.37) 


1 


e-"* dt=^ 
VC. 


1 


e ^ dt < — 


te ^ dt < 


Vcy 


Cy J y/cy 


2cy 


t 


-dt = / --—dt = 


Eiicy"^) . e 


Hence 


'^y 

(logf)e"°*" 

/OO „ 

g-0.1065(^) 


I cy^ 


2t 


< 


2cy'^ 


log f — 1 \ 


log ‘" 2^2 ^ ' e = 


2cy 


'To 


^log^V^ 

27r 2tt at 


dT 


i: 

y ¥T7T 

/ l^l 


-o.ioesr 


2 f\S\ q\S\t 1 


2 ‘“<5 2 


4f 


dt 


< 


f log;^ , f loggM 


V 


_ ^1 

Og/ To 
^\S\ 


Og' To 


O V|i| > 


Sd 


(T^y 


with d = 0.1065. Since Tq > 50 and q> 1, 


StT TT 1 , oTo 

- < - < 0.0152 • - bg — 

8T0 - 200 - 2 ^ 27r 


Thus, the last line of (|9.41|i is less than 


1.0152 ^ = 7.487^ ■ bg ^ 

iF|ir ° 


(9.38) 

(9.39) 

(9.40) 


(9.41) 


(9.42) 
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Again by Tq > 4Tr^\S\, we see that 1.00 577 r|( 5 |/( 4 cTo) < 1.0057/(16c7r) < 0.18787. 

To obtain our final bound, we simply sum (|9.37| and (|9.42|l, after multiplying them 
by the constants 3.286 and 3.001 in ( |9.34| l. We conclude that the integral in ( |9.35| l is at 
most 

+22.5 —log—. 

V 1^0 J 2tt 


□ 


We need to record a few norms related to the Gaussian r]<y{t) = before we 

proceed. Recall we are working with the one-sided Gaussian, i.e., we set ri(y{t) = 0 
for f < 0. Symbolic integration then gives 






H,\l = r{te-''r^fdt = ^. 


\v^ • log |2 = / e ‘ (logtfdt 


= — (tt^ + 27 ^ + 87 log 2 + 8 (log 2)2) < 1.94753, 
16 


(9.43) 


\r]:y{t)/Vi\i = 
|77^(f)/Vf| = \v^{t)Vi\i = 








1.03045 


e-^t^dt = 1.07791. 


(9.44) 


We can now state what is really our main result for the Gaussian smoothing. (The 
version in §7.1|will, as we shall later see, follow from this, given numerical inputs.) 


Proposition 9.2.2. Let 77 (f) = e~* /2_ Letx > 1, 5 € M. Let xbe a primitive character 
mod q, q > I. Assume that all non-trivial zeros p of L{s, x) with |9(p)| < Tq lie on 
the critical line. Assume that Tq > 50. 

Then 


00 

^ A( 7 r)x(n)e 

n—1 



where 


ri{—5)x + O* (err^_^((5, x)) ■ x ifq = 1, 

O* (err,j_^(5, a;)) • x ifq > 1, 

(9.45) 


err, x) = log ^ (3.53e-°-i^®«^“ + 

2 tt \ Jo / 

+ (2.337\/7^log gTo + 21.8175/7(^ + 2.85 log < 7 + 74.38)x" 5 

+ (31ogq + 14|5| + 17)a;“^ + (logg + 6 ) • (1 + 5|(5|) • x~^^'^. 
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Proof. Let ^^(s) be the Mellin transform of By Lemmas 9.1.4 (with = 

Fs) and Lemma [9.2.1[ 

p non-trivial 


is at most (9.29 1 (with rj = times ^/x, plus 


log 




27r 


Tn 


By the norm computations in ( |9.43| l and ( |9.44| l, we see that ( |9.29 1 is at most 
2.337y/fo log qTo + 21.817+ 2.85 log q + 74.38. 


Let us now apply Lemma [9.Ll| We saw that the value of R in Lemma 9.1.1 is 
bounded by ( 9.23[ ). We know that 7y(;p (0) = 1. Again by (9.43 i and (9.44 1 , the quantity 
Co defined in (9.3 i is at most 1.4056 + 13.3466|(5|. Hence 


Lastly, 

Clearly 


|i?| < 3logg + 13.347|(5| + 16.695. 

|p^|2 + 27r|(5||p.^|2 < 0.942+ 4.183|<5| < 1 + 5|<5|. 


(6.01 - 6) • (1 + 5|(5|) + 13.347|5| + 16.695 < 14|5| + 17, 
and so we are done. 


□ 


9.3 The case of r]^{t) 


We will now work with a weight based on the Gaussian: 


r]{t) 


f2e-‘V2 ifi>o, 
0 if f < 0. 


(9.46) 


The fact that this vanishes at f = 0 actually makes it easier to work with at several 
levels. 

Its Mellin transform is just a shift of that of the Gaussian. Write 

Fs{s) = {M{e-^e{St))){s), 

Gs{s) = {M{r]{t)e{6t))){s). 

Then, by the definition of the Mellin transform, 

Gs{s) = Fs{s + 2). 

We start by bounding the contribution of zeros with large imaginary part, just as 
before. 
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Lemma 9.3.1. Letr]{t) = Letx S K'*', (5 G K. Let x be a primitive character 

mod q, q > Assume that all non-trivial zeros p of L{s, x) with |9(/9 )| < Tq satisfy 
?R.{s) = 1/2. Assume thatTg > inax(107r|(5|, 50). 

Write Gs{s) for the Mellin transform of r]{t)e{dt). Then 

\Gs{p)\ < ^olog|^ • . 

p \ 2 

mp)\>To 

Proof We start by writing 

E |G'^(9)I= E {\Fs{p + 2)\ + \Fs{{1-p) + 2)\), 

p p 

|S(p)|>To a(p)>To 


where we are using Gs{p) = Fs{p + 2) and the fact that non-trivial zeros come in pairs 
pA- P- 

By Cor. 8.0.2 with k = 2, 


E \Gs{p)\< E 

p p 

|a(p)l>To S(p)>To 


where 


/(t) = 




/v 2 ,i T e 


-0.1598|t| 




(9.48) 


where K 2.0 = 7.96 and K 2 x = 5.13. We are including the term in both 

cases in part because we cannot be bothered to take it out (just as we could not be 
bothered in the proof of Lem. 9.2.1 1 and in part to ensure that /(t) is a decreasing 
function of r for r >To. 


We can now apply Lemma[9.1.3| We obtain, again. 


E < r ( 

n ■'T'o V 


P 

Kp)>To 


1 1 qT 


1 

LT 


dT. 


(9.49) 


Just as before, we will need to estimate some integrals. 
For any y > 1, c, ci >0 such that log y > 1/(cy). 


te 


-Ut = ( ^ + 4 


(flog 


t + 


Cl 


^dt < 


t + 


a — 1 


logt - 


a 

cH 




^dt 


(9.50) 
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where 


a = 


logy 1 £1 

c ' c ' y 


logy 


1 

c^y 


Setting c = 0.1598, ci = 7r/2, y = Tq > 50, we obtain that 


/:( 




+ + logTo) 


27r 


27r 


(9.51) 


„-0.1598To 


and 


1 V2 


logJo 

0.1598 ' 0.1598 ' Tq ^ 299 

log To 1 — ' ■ 

0.1598 


0.15982To 

It is easy to see that ratio of the expression within parentheses on the right side of 
(9.51 1 to Tq log((7To/27r) increases as q decreases and, if we hold q fixed, decreases as 
To > 2tt increases; thus, it is maximal for <7 = 1 and Tq = 50. Multiplying ( 9.51| l by 
K 2.1 = 5.13 and simplifying by the assumption Tq > 50, we obtain that 


'To 


5.13Te-°-i^9«^ ( ^ log ^ ) dT < 6 .IIT 0 log • e 


qTq 

2tt 


1 

4T 


QTq ^ 

.1598To 
27r 

(9.52) 

Now let us examine the Gaussian term. First of all - when does it arise? If Tq > 
( 3 / 2 )( 7 rd)^, then |r| > ( 3 / 2 )( 7 r( 5 )^ holds whenever |r| > Tq, and so 
give us a Gaussian term. Recall that Tq > 107r|(5|, which means that 
implies that Tq > (3/2) We can thus assume from now on that 

since otherwise there is no Gaussian term to treat. 

For any y > 1, c,ci > 0, 


9.48 1 does not 


<5| < 20 /( 37 r) 
5| > 20/(37r), 


t^e dt < 


f + 


1 


dt = 


y 


1 


pOO pOO / 

/ log t + cif) • dt< log t + 

Jy Jy \ 


2c 

at log et log et 


2c 


where 


( 2 cy + a)logy + a .^^2 
4c2 ■ ® ’ 


2 c 


4c2f J 


dt 


ciy+^ 


y log ey 

2c 


1 

4c^y 


ciy ■ 


Ac^y^ 


1 ^ 2CiC 

IHsEIirZ'^ - „ + log ey 

2c 4c2y ^ ^ 


2cy log ey 4c^y^ 


y log ey 

2c 


I 

4c2y 


(Note that a decreases as y > yo increases, provided that log eyo > l/(2cyo)-) Setting 
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c = 0.1065, Cl = 1/(2|(5|) < 3/16 and y = To/(7r|(5|) > 47r, we obtain 

1 


1 q\6\t 

jUl V 27r 2 4Tr\6\t 


^2g-0.10654' 


dt 


< 




1 


1 


\27rc|(5| 4c^ • 10^ 

f2c-^ + a) log +a / Tn \2 

^ V ^\°\ ) ^ ^l°l ^ -0-1065(^) 


=-0-1065(:^)" 


4c2 


and 


1 

a < — 
- 10 


' 2-20 
>. Stt 


, -1 


10 - 


10 los lOe 


4 0.1065^-10^ < 0 117 


2 0.1065 4-0.10652-10 

Multiplying by (K2,o/4)7r|5|, we get that 

2 


^2,0 


'To 


-0.1065(^)^ {^ ,_97’o 1 

r|^| ■ 


271 271 ' 4T 


dT 


IS 


-0.1065('^')^ 

at most e times 


(1.487To + 2.194|J|) • log ^ + 1.487To log ^ + 2.566|(5| log ^ 

2 7r\d\ 7r\d\ 


< 1.487 + 2.566 


iii+ljj-.logf < 1.578.Tolog 


g^O 
271 ’ 


(9.53) 


(9.54) 


where we are using several times the assumption that Tq > 471^|^| (and, in one occa¬ 
sion, the fact that |(5| > 20 /(Sti) > 2). 

We sum (|9.52|i and the estimate for (|9.53|l we have just got to reach our conclusion. 

□ 


Again, we record some norms obtained by symbolic integration; for y as in (9.461, 


|9 ■ log 1+ ^ ( 8(37 - 8 ) log 2 + 371^ + 67 ^ + 24(log 2f + 16 - 327 ) 

< 0.16364, 

|77(t)/v^|i = ^ 1 07791, \v{t)Vi\i = ^23/4r(3/4) < 1.54568, 

p \/^ 2 ^00 2 

|r7'(f)/Vf|i = / / f3/2g-Vf^i < 1 48469^ 

Jo J y /2 

l9'(f)+i < 1.72169. 


(9.55) 
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Proposition 9.3.2. Let rj{t) = Let a: > 1, <5 G M. Let x be a primitive 

character mod q, q > 1 . Assume that all non-trivial zeros p of L{s,x) wif/z |9(p)| < Tq 
lie on the critical line. Assume that Tq > max(107r|(5|, 50). 

Then 


n—1 

where 


^ A(n)x(n)e ( -n j pin/x) = 


fj{—6)x + O* (err^_^((5, a;)) • x if q = 1, 

O* (eiTy^^lS, x)) ■ X if q > 1, 

(9.56) 


qTo 


erVrjM, x) = To log ^ • ( 6.11e"°-^^®®^'> + 1.578e 

ZTT ' 


- 0 . 1065 -^^ 


+ (l.22\/i)^logq^o + 5.056^/i)^+ 1.4231ogg +37.19^ • 
+ (3 + ll|i5|)x“^ + (logq + 6 ) • (1 + 6|<5|) • 


Proof We proceed as in the proof of Prop. 9.2.2 The contribution of Lemma 9.3.1 
qTo 


(9.57) 
is 


To log 


27r 


6,iig-0.1598To^l_578g-0.1065.^ 


whereas the contribution of Lemma l9.1.4l is at most 

{\.22s/¥o log qTo + 5.056\/71) + 1.423 log q + 37.188)y^. 
Let us now apply Lemma [9.Ll| Since 77 ( 0 ) = 0, we have 
R = O*(co) = 0*(2.138 + 10.99|5|). 

Lastly, 

Ir^'la +27r|(5||77|2 < 0.881 + 5.123|(5|. 


□ 


Now that we have Prop. 9.3.2 we can derive from it similar bounds for a smoothing 
defined as the multiplicative convolution of p with something else. In general, for 
Ti I T 2 : [0, 00 ) —>■ C, if we know how to bound sums of the form 


f{n)ipi{n/x), 


(9.58) 


we can bound sums of the form simply by changing the order of summation 

and integration; 


Sf,V>l*MV2 = XI T 2 ) (^) 

71 

./n ^ \WXy 


dw 

w 


Sf,ipt{wx)(p2{w) 


dw 

w 

(9.59) 
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This is particularly nice if (p 2 (t) vanishes in a neighbourhood of the origin, since then 
the argument wx of {wx) is always large. 

We will use ipi{t) = t^e~* = 9 i *m 9 i, where 771 is 2 times the char¬ 

acteristic function of the interval [1/2,1]. The motivation for the choice of (pi and ip 2 
is clear: we have just got bounds based on (pi (t) in the major arcs, and we obtained 
minor-arc bounds for the weight ip 2 {t) in Part[^ 

Corollary 9.3.3. Let r]{t) = ryi = 2 • /[i/ 2 ,i], 92 = 9 i *m Vi- V* = 

772 *M V- (5 G K. Let X be a primitive character mod q, q > 1- Assume 

that all non-trivial zeros p of L{s, x) with |9(p)| < Tq lie on the critical line. Assume 
that Tq > max(107r|(5|, 50). 

Then 


= 1 , 

> 1 , 

(9.60) 

err^,x.(<5:2:) = Tolog^ • + 0.0102 • 

+ ^1.679\/T[) logqTo + 6.957\/Tj) -I- 1.9581ogg -f 51.17^ • x~^ 

-f (6 -f 22\5\)x-^ + (logg + 6) • (3 -f 17|<5|) • 

(9.61) 


^ A{n)x{n)e f -n j 77 * {n/x) = 

n—1 ^ ' 


Tft{—5)x + O* {eYVT^^^^{5,x)) ■ X ifq 
O* {eTrr,,^y(^{S,x)) ■ X ifq 


where 


Proof The left side of ( 9.60| l equals 

/ ^ Hn)x{n)e (—\ p i—') p2{.w) 

Jo \ ^ / ^wxy 


dw 

w 


[ ^Hri)xiri)e(^^^ri(—)ri2{w) — , 
Ji V / \wxJ w 

4 n—1 ^ 


since 772 is supported on [—1/4,1]. By Prop. 9.3.2 the main term (if g = 1) contributes 
[ rj{—5w)xw ■ ri 2 {w)—=x [ p{—Sw)r] 2 {w)dw 

J\ w Jo 

= x / rj{t)e{5wf)dt ■ ri 2 (w)dw = x / / r]{ — ]e{Sr) — r] 2 {w)dw 

Jo 3-00 Jo 3-00 w 

I (y ^ (“) e(( 5 r)dr = ?^(- 5 ) • X. 


= X 


The error term is 


wx) ■ wx ■ 772 ( 77 ;) 


dw 

— = X ■ 

w 


erijj^^{Sw, wx)rj 2 {w)dw. 


(9.62) 


1 


1 
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Using the fact that 


41og4t(; if w G [1/4,1/2], 

ri2{w) = ■^ 4:\ogw~^ if w G [1/2,1], 

0 otherwise, 


we can easily check that 

POO 

/ 'r]2{w)dw = 1 , 

^0 

poo poo 

/ w~^r] 2 {w)dw = 4(log2)^ < 1.92182, / w~^^‘^r] 2 {w)dw < 2.74517 

Jo Jo 


POO 

/ 'w~^^'^rj2{w)dw < 1.37259, 

Jo 


and, by rigorous numerical integration from 1/4 to 1/2 and from 1/2 to 1 (using, e.g., 
VNODE-LP BNedOhl ). 



< 0.006446. 


We then see that ( |9.57| l and ( |9.62| l imply ( |9.61| l. 


□ 


9.4 The case of ?7+(i) 

We will work with 

r]{t) = r]+{t) = hH{t) ■ tricy{t) = hnit) ■ (9.63) 


where hn is as in (7.6i. We recall that hn is a band-limited approximation to the 
function h defined in (7.5 1 - to be more precise, MhH{it) is the truncation of Mh{it) 
to the interval [—77, 77]. 

We are actually dehning h, hn and ry in a slightly different way from what was done 
in the hrst version of IHelal . The difference is instructive. There, ry(f) was dehned as 
and hu was a band-limited approximation to a function h defined as in 
(7.51, but with f^(2 — t)^ instead of — t)^. The reason for our new dehnitions is 
that now the truncation of Mhiit) will not break the holomorphy of Mr], and so we 
will be able to use the general results we proved in §9.1] 

In essence, Mh will still be holomorphic because the Mellin transform of trj<y{t) is 
holomorphic in the domain we care about, unlike the Mellin transform of (f), which 
does have a pole at s = 0. 

As usual, we start by bounding the contribution of zeros with large imaginary part. 
The procedure is much as before: since ry+(t) = VH{t)r]<y{t) , the Mellin transform 
Mr]+ is a convolution of M(te~^ and something of support in [—77, 77]i, namely, 
MrjH restricted to the imaginary axis. This means that the decay of Mri+ is (at worst) 
like the decay of delayed by 77. 
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Lemma 9.4.1. Let rj = rj.^. be as in (9.63 \for some H > 25. Let x G 


5 G K. Let 


Xbe a primitive character mod q, q > 1. Assume that all non-trivial zeros p of L(s, x) 
with |3(/3)| < To satisfy 3?(s) = 1/2, where Tq'> H + inax(107r|(5|, 50). 

Write Gs{s) for the Mellin transform of r]{t)e{dt). Then 


^ (^11.308v^e-°-i598^o + 16.147|<5|e j log 

mp)\>To 

where T/ = Tg — H. 

Proof. As usual, 


E \Gs{p)\= E (|G^(p)| + |G,(l-p)|). 


mp)\>To 


P 

^(p)>To 


Let Fs be as in (9.47 1 . Then, since r]o^{t)e{5t) = ^ /^e((5t), where hn is as 

in we see by ^2.9^ that 

1 

^s{s) = — / Mh{ir)Fs{s + 1 — ir)dr, 

27r 

and so, since \Mh{ir)\ = \Mh{—ir)\, 


i-H 


|G5(p)| + |G5(i — p)\ < 


27r 


'-H 


\Mh{ir)\{\Fs{l + p-ir)\ + \Fs{2-{p-ir))\)dr. 

(9.64) 

We apply Cor. 8.0.2 with k = 1 and Tq — H instead of Tq, and obtain that \Fs{p) \ + 
1^/(1 - p)\ < gir), where 


9ir) = 


(9.65) 


where /ci o = 4.903 and nip = 4.017. (As in the proof of Lemmas 9.2.1 and 9.3.1 
are putting in extra terms so as to simplify our integrals.) 

From (|9.64|i, we conclude that 


we 


\Gs{p)\ + \Gsil-p)\<f{T), 


for p = a + ir, T > 0, where 




\Mh{ir)\i 

2tt 


■g{T-H) 


is decreasing for t > Tn (because p(r) is decreasing for t > Tn — 
\Mh(ir)\i < 16.193918. 


H). By (A.17I, 
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We apply Lemma 9.1.3 and get that 


\mp)\>To 


'To 

\Mh{ir)\i 

27r 


E \GM\< i + 


27r 2tt at 




(9.66) 


Now we just need to estimate some integrals. For any y > e^, c > 0 and k, ki > 0, 




,-cy 


I ('^ ‘“S'*+^ +' 


„-cy 


where 


1 1 + CKi 

a = :^ + 


2 log(j/ + k) ■ 

The contribution of the exponential term in ( |9.65[ ) to ( |9.66| l thus equals 

Ki^i\Mh{ir)\i 


27r 


< 10.3532 




iTo-H 




-0.1598T 


dT (9.67) 


< 


10.3532 /y/To - H 

27r V 0.1598 ' 0.1598V7o - H 


+ 


log^ . g-0.1598(To-r/)^ 
27r 


where a = l/2 + (l+0.15987r/2)/logTo. Since To-i7 > hOandTo > 50+25 = 75, 
this is at most 

11.308\/ro - Hlog ^ • e-o i598(To-ff)^ 

27r 

We now estimate a few more integrals so that we can handle the Gaussian term in 
( 9.65| l. For any y > 1, c > 0, k, ki > 0, 





2 


2c 


(tlog(f + k) + Ki)e df < 



ylog{y + K) j 


log{y + k) ■ e 


Proceeding just as before, we see that the contribution of the Gaussian term in (|9.65|l 
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to ( |9.66| l is at most 


27r 


< 12.6368 


< 12.6368 


'To 


l<5| 


1 , qT 
. flog (t- 


fFTTT 


1 

4T 

H 

7r|5| 


T-H 

27r|i5| 


• e 


-0.1065(5^)' 


dT 


iog*a + ?i/? 

® 2 T 


Te 


-0.1065T" 


dT 


1-51 


6-51 


8-0.1065 


1 


2-0.1065-(To-//) 


Tq-H 1 To 

508 ttIM 




7i-| 5| 7r|5| 

Since (Tg — H)/(tt\S\) > 10, this is at most 




(9.68) 


16.147|5|log^-e-°i“K^)'. 

2tt 


□ 


Proposition 9.4.2. Let r] = r]+ be as in {9.63 )for some H > 25. Let x > 10^, (5 S K. 
Let X be a primitive character mod q, q > 1. Assume that all non-trivial zeros p of 
L{s, x) wdh |9(p)| < To lie on the critical line, where Tq > H + max(107r|(5|, 50). 
Then 

rif{-5)x + O* (err^^,x((5,a;)) -x if q = I, 

ifq > I, 
(9.69) 


5: A(n)x(n)e ^-n\p^{n/x) = ^ 

VT y [O* {eTTrj^,^{S,x)) 

where 

= ( 11.308v/7^ • + 16.147|(5|e 


-0.1065 ( -4 


log 


gTp 

271 


+ (1.634 log gTo + + 1.321 log q + 34.51)a;^/^ 

+ (9 + ll|<5|)a;-^ + (logg)(ll + 6|5|)x-3/2, 


(9.70) 

where Tq = Tq — H. 

Proof. We can apply Lemmas 9.1.1 and Lemma [9.1.4 because {t), (log t)ri^ (t) and 
are in £2 (by (A.25i, ( A.28|) and (|A.32 1 ) and and are in 

ii for a in an open interval containing [1/2,3/2] (by ( A.30| l and (A.33 1 ). (Because of 
(9.5 1 , the fact that ?7_|_(f)t“5/2 r^_|_(t)t5/2 are in £1 implies that q+{t) logf is also in 

£ 1 , as is required by Lemma 9.1.4 ) 

We apply Lemmas 9.1.1 9.1.4 and |9.4.ll We bound the norms involving 77 + using 
the estimates in [A.3 and [A.4 Since p+(0) = 0 (by the definition (A.3i of 77 +), the 
term R in \9.2\ is at most cq, where cq is as in (|9.3|l. We bound 


Co < ^ (2.922875 (^V^{l/2) + A/r(3/2)) + 1.062319 (sjT{6/2) + y'Til 12) 
y |(5| • 1.062319 (v/r(3/2) + Vr(5/2)) < 6.536232 + 9.319578|^| 
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using (A.30 1 and ( A.33| l. By (A.251, ( A.32| l and the assumption H > 25, 
b+b < 0.80365, Ip^b < 10.845789. 

Thus, the error terms in ( |9.1| l total at most 

6.536232+9.319578|,5| + (logg + 6.01)(10.845789 + 27r • 0.80365|(5|)a:-i/2 
< 9 + 11|5| + (logg)(ll + &\5\)x-^/^. 

(9.71) 

The part of the sum ^ Gs{p)x'’ in (9.1 1 corresponding to zeros p with |3(p)| > 
To gets estimated by Lem |9.4.r| By Lemma [9.1.4| the part of the sum corresponding 
to zeros p with |5(p)| < Tq is at most 

(1.634\/71) log gTo + 12.43\/71) + 1.321 log q + 


where we estimate the norms [p+b, |?? • log b \rj{t)/\/t\i by (A.25l, (A.28i and 

( |a3o1 ). □ 

9.5 A sum for ? 7 +(t)^ 

Using a smoothing function sometimes leads to considering sums involving the square 

we will need a result involving rj^ 


III 


of the smoothing function. In particular, in Part 
- something that could be slightly challenging to prove, given the way in which is 
defined. Fortunately, we have bounds on |p+|oo and other £oo-norms (see Appendix 


A.5 I. Our task will also be made easier by the fact that we do not have a phase e{6n/x) 
this time. All in all, this will be yet another demonstration of the generality of the 


framework developed in (9.1 


Proposition 9.5.1. Let p = rj^ be as in f 9.631, H > 25. Let x > 10®. Assume that 
all non-trivial zeros p of the Riemann zeta function C,{s) with |5(p)| < Tg lie on the 
critical line, where Tq > max(2i7 + 25,200). 

Then 


OO 

^ A(n)(logn)? 7 ^(n/a::) = a; • / p\{f) \ogxt dt + O*■ x logx, (9.72) 

_1 Jo 

logTi\ 


n—1 

where 


err^, ..=( ( 0.462 + 0.909 logTi^ Ti + 1.71 (l + , , 

VV log2: / V log*/ 

+ log To + 50.04) • 

and Ti = To — 2H. 

The assumption Tg > 200 is stronger than what we strictly need, but, as it happens. 


H 


(9.73) 


we could make much stronger assumptions still. Proposition |9. 5.1 relies on a verifica¬ 
tion of zeros of the Riemann zeta function; such verifications have gone up to values 
of To much higher than 200. 
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Proof. We will need to consider two smoothing functions, namely, 77 +o(f) = ?7+(f)^ 
and 77+^1 = 77 +(<)^ logf. Clearly, 


^ A(n) (log 77 ) 77 +(n/a;) = {^ogx)'^K{n)'q+^Q{n/x)+ '^K{n)r]+,i{n/x). 


Since 77 +(f) = hH{t)te 

V+Ar)=hjj{t)t^e-*^ 


V+Ar) = /7^(f)(logf)f2e- 


Let 77+^2 = (loga;) 77 +,o + V+, i = vl {t)logxt. 

We wish to apply Lemma 9.1.1| For this, we must first check that some norms are 
finite. Clearly, 


V+,2 (t) = A (t) log x + 77^ (t) log t 

9 +. 2 A = ‘^9+{tW+A logx + 2 r]+{t)A{t) logf + vl{t)/t. 


(9.74) 


Thus, we see that r]+, 2 {t) is in £2 because 77 +(f) is in £2 and 77 +(f), 77 +(f) logf are both 
in £00 (see ( |X25] l, ( | 08 ] l, (AA^ )- 


|77+.2(t)l2 < |7+W|2loga;+ |77^(f)logf|2 

< I 7 +I 00 l 7 +l 2 loga;+ | 7 ?+(f)logf|^ | 77 +|. 


(9.75) 


Similarly, rjf 2 (f) is in £2 because 77 +(f) is in £ 2 , Ti+{t) is in £2 (A.32i, and 77 +(f), 
77 +(f) logf and 77 +(f)/f (see ( A.41| i) are all in £ac,: 

|77+,2W|2 ^ |277+(i)77+(f)|2loga;+ |277+(f)77(^(f) logfj^ + 177^ (f)/f|2 

< 2|77+|^ |77(^|2loga; + 2|77+(f) logt|^ \A\^ + \v+it)/t\^ \r]+\.^. 

(9.76) 

In the same way, we s ee tha t rj+^ 2 {t)t'^~^ is in £i for all a in (—1, 00 ) (because the same 
is true of 77 +(f)f'^“^ (A.301, and rj+{f), r]+(t) logf are both in £^ 0 ) and r]f i s 

in £i for all a in (0, 00 ) (because the same is true of rj+{t)P’~^ and (A.331, 

and 77 +(f), 77 +(f) logf, 77 _|_(f)/f are all in £ 00 ). 

We now apply Lemma 9.1.1 with g = 1, i5 = 0. Since 77 + 2 ( 0 ) = 0, the residue 
term R equals cq, which, by (9.74 1 , is at most 2/3 times 

2 (b+loo loga; + | 77 +(f) log<|^) ( V+{i)/Vi ^ 


+ 


\v+{t)/t\oo ( V+{t)/Vi ^ + il+it)Vi 


Using the bounds (A.38i, (A.40l, (A.41 1 (with the assumption H > 25), (A.30i and 
( A.33|l, we get that this means that 


Co < 18.57606 log X + 8.63264. 
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Since q = 1 and J = 0, we get from (9.76i (and (A.381, (A.40i, (A.41 1 , with the 


assumption H > 25, and also (A.25 i and (A.32 1 ) that 


(logg + 6.01)- (\v+, 2\2 + 27^1^1 b+, 212 ) X 

= 6.01< (162.561oga; + 59.325)a;-^/^ 

Using the assumption x > 10®, we obtain 

Co + (185.261oga; + 71.799)a;“^^^ < 19.064logcc. 


(9.77) 


We will now apply Lemma [9.1.4| - as we may, because of the finiteness of the norms 
we have already checked, together with 

l?7+.2Wlogt|2 < (i)logf|2loga;+ 

< \r]+{t)\ogt\^ (|77+(f)|2logx+ |?7+(f)logf|2) 

< 0.4976 • (0.80365 log x + 0.82999) < 0.3999 log x + 0.41301 

(9.78) 

(by (A.40 1 , (A.251 and (|A.28|i; use the assumption H > 25). We also need the bounds 


|??+. 2 (f)l 2 < 1.141991ogx +0.39989 (9.79) 

(from ( 9.75| l, by the norm bounds (A.38 1 , (A.40 1 and (A.25 1 , all with H > 25) and 


77+,2(f)/Vf ^ < (|??+(f)|^logx +|? 7 +(f)logf|^) r]+{t)/Vi 
< 1.4211 logx +0.49763 

(by ( A.38| l, (A. 4 O 1 (again with H > 25) and ( A.30| l). 


(9.80) 


Applying Lemma 


9.1.4 


we obtain that the sum ^ |G'o(p)|x'’ (where Go{p) = 


Mr7+ 2 (p)) over all non-trivial zeros p with |3(p)| < Tq is at most times 

(1.54189 log X + 0.8129) v/T)) log To + (4.21245 log x + 6.17301)^/7}) 

+ 49.1 logx + 17.2, 


(9.81) 


where we are bounding norms by \9.19\ , ( |9.78| l and ( |9.80[ ). (We are using the fact that 
To > 2TTy/e to ensure that the quantity /Tq log Tq — (log 2'K^/e)/TQ being multiplied 
by |p+, 2|2 is positiv e; thus , an upper bound for |g+, 2|2 suffices.) By the assumptions 
X > 10®, To > 200, ( 9.8l| ) is at most 


(2.445 \/7)(log To + 50.034) logx. 

In comparison, 19.064x“^/^ logx < 0.002logx, since x > 10®. 

It remains to bound the sum of M 77 + 2 (p) over zeros with |5(p)| > Tq. This we 
will do, as usual, by Lemma 9.1.3 For that, we will need to bound M?7_|_ 2 (/o) for p in 
the critical strip. 
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The Mellin transf orm o f e is r(s/2)/2, and so the Mellin transform of t^e 
is r(s/2 + l)/2. By (2.10i, this implies that the Mellin transform of (logf)f^e“* is 
r(s/2 + l)/4. Hence, by (I^, 


where 


1 

M77+,2(s) = — / M{h%){ir) ■ F^{s - ir)dr, 

47r 

Fx(s) = (loga;)r (2 + + 2 ^^ (2 ’ 


(9.82) 


(9.83) 


Moreover, 


1 1 '°° 

M{h?u){ir) = — / MhH{iu)MhH{i{T — u)) du, (9.84) 

J — 00 

and so M{h?^){ir) is supported on [—2iT, 2iT]. We also see that \M h%f{ir)\i < 
\MhH{ir)\\/2TT. We know that \MhH{iT)\\/2'K < 41.73727 by (A.17 1 . 

Hence 


1 

|M?7+, 2 ( 5)1 < — / |M(/i^)(ir)|dr- max |T;(s-ir)| 

47r \r\<2H 

41.73727 

< -• max 1771(5 — ir)| < 3.32135 • max \Fx(s — ir)\. 

An \r\<2H' ^ \r\<2H' ^ 


(9.85) 


By (8.51|l (Stirling with explicit constants). 


|r( 5 )| < + 

when 3?(s) > 0, and so 

|r(s)| < ^ ^ 12 ^ ^ ^ 


(9.86) 


(9.87) 


< 2.542|S5(5)|e 




for 5 e C with 0 < 5R(5) < 3/2 and |5(5)| > 25/2. Moreover, by OOLBClOl 5.11.2] 
and the remarks at the beginning of OOLBClOl 5.1 l(ii)]. 


1 


1 


r(5) 25^^ V12|5P COS3 0/2 

for I arg(s)| < 9 (6 G (—7r,7r)). Again, for 5 = cr + ir with 0 < cr < 3/2 and 
|t| > 25/2, this gives us 


1 


1 


1 


^-logW + iogLELF^ + o'l^^j+o 


= logW + 0-(^ + ^ 

, , , _ /0.609\ 

= iogM + o- 


O*(0.236) 
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Hence, for 0 < 3fi(s) < 1 (or, in fact, —2 < 3?(s) < 1) and |3(s)| > 25, 


\Fxis)\ < ( (logo;) + 2 log 


O* 


0.609 

VM 


r 


(I-) 


< 2.542((loga;) + ^ log |t| - 0.297) 


(9.88) 


Thus, by (9.85 1 , for p = a + ir with |t| > Tq > 2H + 25 and 0 < cr < 1, 

\Mr]+^2ip)\ < fir) 

where 


f(T) = 8.45 ( log a; + i log T 


-H] - e 




(9.89) 


The functions 1 1 —>■ and 1 1 —^ (logf)fe“'^‘/^ are decreasing for t > e (or in fact 

for t > 1.762); setting f = r/2 — 77, we see that the right side of (9.89 1 is a decreasing 
function of T for T > Tq, since To/2 — 77 > 25/2 > e. 

We can now apply Lemma 9.1.3[ and get that 

poo / 1 rp 1 \ 

E + (9.90) 

|a(p)l>To 

Since T > Tq > 75 > 2, we know that ((l/27r) log(T/27r) + 1/4T) < (l/27r)logT. 
Hence, the right side of (9.90|l is at most 


8.39 7~ 
47r 


(logT)" 


< 


(log x) (log T) 

0.668 J ^(logcc) ^logf+ + 


7r(T' — 2H) 

(T- 277)e-J —dT 


2H\ , ^ (logf)^ ^ 


(9.91) 


te « dt, 


where Ti = Tq — 277 and t = T — 277; we are using the facts that (logf)" < 0 for 
f > 0 and ((logf)^)" < 0 for t > e. (Of course, Ti > 25 > e.) 

Of course, = (4/7r)e“l’^/^l^L We recall (9.36i and (9.50i: 


/Ti 


log f • e ^ dt< [ log Ti + — 


4/7r\ e 4 


-fTi 


J (logf)fe < ^Ti + 


7r/4 

4a\ iQgXi 


7r/4 


for Ti > 1 satisfying logTi > 4/(7rTi), where a = 1 + (1 + 4/(7rTi))/(logri — 
4 /(ttTi)). It is easy to check that log Ti > 4/(7rTi) and 4a/7r < 1.6957 for Ti > 25; 
of course, we also have {A/tt)/25 < 0.051. Lastly, 

46\ e“5^i(logTi)^ 


J (\ogtfte-^*dt < (ti + 


7r/4 
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for Ti > e, where b = 1 + (2 + 8/(nTi))/{\ogTi — 8/(7rTi)), and we check that 
46/7r < 2.1319 for Ti > 25. We conclude that the integral on the second line of (9.91 1 
is at most 


4 

TT 




-(Ti + 2.132) + (loga;)(logri)(ri + 1.696) 




+ —• 2ff (log Ti + 0.051 + log a;)e . 

TT 

Multiplying this by 0.668 and simplifying further (using Ti > 25), we conclude that 
Ep:|9(p)|>To \Mv+,2{p)\ is at most 

((0.462logTi +0.909logo:)(logTi)Ti + 1.71(logTi + loga;)iJ) 


□ 


9.6 A verification of zeros and its consequences 


David Platt verified in his doctoral thesis iIpmTI . that, for every primitive character x 
of conductor q < 10^, all the non-trivial zeroes of L{s, x) with imaginary part < 10®/g 
lie on the critical line, i.e., have real part exactly 1/2. (We call this a GRH verification 
up to 10®/^.) 

In work undertaken in coordination with the present work IPlabl . Platt has extended 
these computations to 

• all odd q <3 ■ 10®, with Tg = 10 ®/( 7 , 

• all even g < 4 • 10®, with Tg = max(10®/g, 200 + 7.5 • 10^/g). 


The method used was rigorous; its implementation uses interval arithmetic. 


Let us see what this verification gives us when used as an input to Prop. |9.2.2| We 

< 4r/g. We set 
10® < g < 4 • 10®, though we 


are interested in bounds on | err^ (<5, x) \ for g < r and |i5| < 4r/g. We set r = 3-10®. 


(We will not be using the verification for g even with 3 
certainly could.) 

We let To = 10®/g. Thus, 


^ 10 ® 1000 

° - 3 • 10® 3 ’ 

To ^ 10®/g _ 1000 
7 r|5| ~ TT • 4r/g 127r 

and so, by |(5| < 4r/g < 1.2 • 10®/g < 1.2 • 10®, 

3.53e"°-^®®®^'> < 2.597- 10"^®, 


(9.92) 
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Since qTq < 10®, this gives us that 

log ^ f 

ZTT \ 1q J 

^ ^ 0~26 

< 4.3054 • 10"^2 + - -< 4.306 • 10"^^ 

9 

Again by Tq = 10®/g, 

2.myj% log qTq + 21M1^J% + 2.85 log q + 74.38 


is at most 


and 


648662 


111 , 


31ogg + 14|5| + 17<55 + 


(log? + 6 ) • (1 + 5|<5|) < 19 + 


1.7-10^ 

q 

1.2 • 10 ® 


Hence, assuming x > 10® to simplify, we see that Prop. 
err^_^((5, x) < 4.306 • 10“^^ + 


9.2.2 


55 + 1 . 700 ^ 


gives us that 

19+LMo! 


, 3/2 


< 4.306 • 10"22 + ^ 


1 /650400 


V V? 


112 


for 77 (f) = e * This proves Theorem 


7.1.1 


Let u s now see what Platt’s calculations give us when used as an input to Prop. [93)2 


and Cor. 9.3.3 Again, we set r = 3 • 10®, = 8 , |(5| < Ar/q and Tg = llf’/q, so 

\9.92) is still valid. We obtain 


To log 


qTp 

27r 


6.11e-°i®98To + i,578g-oi06®'/^ 


<log 


10 ® 


27r 

< 2.485 • 10"^®, 


6.11 • 1000^_o.i598.ic^ ^ ^^8 ^ ^ J.^g^_0.i065(^) = 

3 


since f exp(—0.1598f) is decreasing on f for f > 1/0.1598. We use the same bound 
when we have 0.0102 instead of 1.578 on the left side, as in ( |9.61| l. (The coefficient 
affects what is by far the smaller term, so we are wasting nothing.) Again by Tg = 
10®/(7 and q < r. 


279793 

1.22v3c)log(7rg + 5.053V71) + 1.423logg + 37.19 < -+ 55.2 

Vq 

1.679x/71) log qTo + 6.957+ 1.958logg + 51.17 < -— + 75.9. 
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For X > 10®, we use |(5| < Ir/q < 1.2 • 10®/g to bound 

(3 + ll|(5|)a:“^ + (logq + 6) • (1 + 6|(5|) • < ^0.0004 + 

{6 + 22\5\)x-^ + (log 9 + 6) • (3 + 17|(5|) • < (^0.0007 + 


1322^ 

X"®/®. 

7 ) 

2644' 

^ x"®/2 


Summing, we obtain 


err,,^ < 2.485 • IQ-^^ + X( + 56 


\ y/q 


for r](t) = t^e * and 


err,,< 2.485.10- + ^f^ + 76 
VX\ y/q 


for 77 (f) = f®e * *M V 2 {t). This proves Theorem 


7.1.2 


and Corollary 


7.1.3 


Now let us work with the smoothing weight 77 _|_. This time around, set r = 150000 
if q is odd, and r = 300000 if q is even. As before, we assume 

q<r, |5| < Ar/q. 

We can see that Platt’s verification llPlabI , mentioned before, allows us to take 

q 

since Tq is always at least this {Tq = 10®/(7 > 200 + 7 • 10^/g > 200 + 3.75 • 10^/g 
for q < 150000 odd, Tq > 200 + 7.5 • 10^/g for q < 300000 even). 

Thus, 

250r 250r 


H = 200, 


Tn-H = 


> 


= 250, 


To- H ^ 250r ^ 250 


, > ^ > — = 19.89436... 

TTO TTOq 47r 

and also 

To < 200 + 250 • 150000 < 3.751 • 10^ qTo <rH + 250r < 1.35 • 10®. 
Hence, since Vfe”® ®®®®* is decreasing on f for f > 1/(2 • 0.1598), 

11.308a/To - + 16.147|<5|e"°'^“® 

< 7.9854 • 10"®® + — • 7.9814 • 10"®® 

< 7.9854.10"®® 

q 
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Examining (9.70 1 , we get 

1.35-108 

< log- 


27r 


7.9854 • 10 


-16 


9.5777 • 10- 


/ Vl 35 • 10® \ 1 

+ (l.6341og(1.35 • 10®) + 12.43) ———-+ 1.321 log300000 + 34.51 ^ 

V V9 I VX 


+ 9 + 11 


1.2 • 10 ® 


+ (log 300000) 11 + 6 


1.2 • 10 ® 

q 


,.- 3/2 


< 1.3482 • 10"^^ + 


1.617- 10-1® 


/ 499845 


51.17- 


1.32 - 10® 


9 9.1 - 10^ 

+- 


qx 


\ Vq 

Making the assumption x > 10^^, we obtain 

1 . 617 - 10 - 1 ® 


139 

X 


errj7+,x(^, 3:^) < 1.3482 - 10 ^ + 


q 


/499900 

V 


52 


This proves Theorem |7. 1 .4| for general q. 

Let us optimize things a little more carefully for the trivial character xt- Again, 
we will make the assumption x > 10^^. We will also assume, as we did before, that 
|(5| < Ar/q-, this now gives us |^| < 600000, since q = I and r = 150000 for q odd. 
We will go up to a height Tq = H + OOOOOOtt - 1, where H = 200 and t > 10. Then 


To-H 6000007rf 


ttS 


Attt 


> t. 


Hence 


11.308Vro - + 16.147|<5|e ° 


< 10-1300000 + 96 8 9 0 00e-°i°®®‘". 

Looking at ( 9.70| l, we get 

err^, ,x^((5,x) < log p- - flO"!®®®®®® + 9689000e-° i°®®*" 

27r V 

+ ((1.6341ogTo + 12.43) v^ + 34.51)a:-i/2 + 6600009a;-i. 

The value f = 20 seems good enough; we choose it because it is not far from optimal 
for x ~ lO^i’. We get that Tq = 120000007r + 200; since Tq < 10®, we are within the 
range of the computations in BPlabl (or for that matter IIWed03l or BPlaalO . We obtain 


■) 


err^^,XT(^>a;) < 4.772 - 10-“ + 


251400 

^/x 


Lastly, let us look at the sum estimated in (9.72 1 . Here it will be enough to go up 
to just To = 2H + max(50, 77/4) = 450, where, as before, H = 200. Of course, the 
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verification of the zeros of the Riemann zeta function does go that far; as we already 
said, it goes until 10 ® (or rather m ore: s ee IIWed03l and llPi^ f. We make, again, the 
assumption x > 10^^. We look at (9.73 i and obtain that err^ 2 .r;+ is at most 


0.462 


(log 50)2 


+ 0.909log50^ • 50 + 1.71 (^1 + ’ 200^ 


log 10^2 

+ (2.445v^log450 + 50.04) • 


< 5.123 • lO"^'^ + 


366.91 


Va 


(9.93) 


It remains only to estimate the integral in (9.72 1 . First of all. 


D pOO 

7]^ (t) log xt dt = / 7]^ (t) log xt dt 

Jo 

pOO 

+ 2/ {r]+{t) - r]o{t))r]o{t)logxt dt ■ 


(p+(f) - rio{t)f logxt dt. 


The main term will be given by 


772 ( 1 ) loga;! dt = (0.64020599736635 + O (lO-^^)) log a: 

- 0.021094778698867 + O (lO'^®) , 

where the integrals were computed rigorously using VNODE-LP BNedOhll . (The in¬ 
tegral irjl{t)dt can also be computed symbolically.) By Cauchy-Schwarz and the 
triangle inequality. 



(r 7 +(l) - r]o{t))r]o{t)\ogxt dt < \rj+ - r]o\ 2 \Vo{t) \ 0 gxt \2 


< |r 7 +- ? 7 o| 2 (|??o| 2 loga:+| 77 o • log I 2 ) 
274.86, 

< -^^(0.800131oga: +0.214) 

< 1.944-10-®-loga: + 5.2-10-^ 


where we are using (A.23 1 and evaluate |? 7 o • log I 2 rigorously as above. By (A.23 1 and 


7*00 / 274 86 \ 2 

(7?+(l) - 7 ?o(t ))2 logxf dt< j loga:- 


27428 

~w~ 


< 5.903 • 10-12 . logx + 2.143 • 10 


-12 


We conclude that 

(t) log xt dt 

= (0.640206 + (9* (1.95 • 10"®)) log a: - 0.021095 + 0*(5.3 • lO"!’) 



(9.94) 
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We add to this the error term 5.123 • 10 
using the assumption x > 10^^. We obtain; 


366.91/v^ from (9.93 i, and simplify 


^ A(n)(logn)? 7 ^(n/a;) = 0.640206a; log a; — 0.021095a; 

+ 0* [2 ■ 10“®x log X + 366.9lVa; log x) 


n—1 


(9.95) 


and so Prop. 9.5.1 gives us Proposition |TT3 


As we can see, the relatively large error term 2 • 10 ° comes from the fact that we 


have wanted to give the main term in (9.72 1 as an explicit constant, rather than as an 
integral. This is satisfactory; Prop. |7.l)5 is an auxiliary result that will be needed for 
one specihc purpose in Part|^ as opposed to Thms. 7.1.1 -7.1.4 which, while crucial 
for Partll^ are also of general applicability and interest. 
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Chapter 10 

The integral over the major arcs 


Let 

Srj{a,x) =''^^A{n)e{an)r]{n/x), (10.1) 

n 

where a € K/Z, A is the von Mangoldt function and 77 : M —>■ C is of fast enough 
decay for the sum to converge. 

Our ultimate goal is to bound from below 


A{ni)A{n2)A{n3)r]i{ni/x)ri2in2/x)r]3{n3/x), ( 10 . 2 ) 

ni+n2+n3=N 

where 771 , 772,773 : K —?► C. Once we know that this is neither zero nor very close to 
zero, we will know that it is possible to write N as the sum of three primes ni, 712,773 
in at least one way; that is, we will have proven the ternary Goldbach conjecture. 


As can be readily seen, (10.2 1 equals 


Srj 3 {a, x)Sri 2 (a, x)Srf 3 {a, x)e{—Na) da. 


(10.3) 


In the circle method, the set K/Z gets partitioned into the set of major arcs and the 
set of minor arcs m; the contribution of each of the two sets to the integral (10.3 1 is 
evaluated separately. 

Our objective here is to treat the major arcs: we wish to estimate 



Sri 3 {a, x)Sri 2 {a, x)Srj 3 {a, x)e{—Na)da 


for dJl = dytso,r, where 


(10.4) 


U U (“ 

g<r a mod q '' 
q odd {a,q) — l 


Sqt a 5^ 
2 gx ’q 2qx 


-u u G 

g<2r a mod q ^ 
q even {a,q) — l 


Sgr a 
qx ’ q 


qx ) 
(10.5) 
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and (5o > Oj > 1 are given. 

In other words, our major arcs will be few (that is, a constant number) and narrow. 
While IILW021 used relatively narrow major arcs as well, their number, as in all pre¬ 
vious proofs of Vinogradov’s result, was not bounded by a constant. (In his proof of 
the hve-primes theorem, IITaol4l is able to take a single major arc around 0; this is not 
possible here.) 

What we are about to see is the general major-arc setup. This is naturally the place 
where the overlap with the existing literature is largest. Two important differences can 
nevertheless be singled out. 

• The most obvious one is the presence of smoothing. At this point, it improves 
and simplihes error terms, but it also means that we will later need estimates for 
exponential sums on major arcs, and not just at the middle of each major arc. (If 
there is smoothing, we cannot use summation by parts to reduce the problem of 
estimating sums to a problem of counting primes in arithmetic progressions, or 
weighted by characters.) 

• Since our T-function estimates for exponential sums will give bounds that are 
better than the trivial one by only a constant - even if it is a rather large con¬ 
stant - we need to be especially careful when estimating error terms, hnding 
cancellation when possible. 


10.1 Decomposition of Srj by characters 


What follows is largely classical; cf. 0HL22I or, say, 0Dav671 §26]. The only difference 
from the literature lies in the treatment of n non-coprime to q, and the way in which 


we show that our exponential sum ( 10.81 is equal to a linear combination of twisted 
sums over primitive characters x* ■ (Non-primitive characters would give us L- 
functions with some zeroes inconveniently placed on the line JRjs) = 0.) 

Write r(x, b) for the Gauss sum 


-{X,b)= X{a)e{ab/q) 


( 10 . 6 ) 


a mod q 


associated to a 6 S Z/gZ and a Dirichlet character x with modulus q. We let r(x) = 
t{x, !)■ If (b, q) = 1, then r(x, b) = x(6"^)r(x). 

Recall that x* denotes the primitive character inducing a given Dirichlet character 
X. Writing modg f™' ^ characters x of (Z/gZ)*), we see that, for any 

oo G IjlqL, 


1 


XI '^ix,b)x*{ao) 

X modg 


1 

Hq) 


X X x{a)e{ab/q)x*{ao) 

X mod q a mod q 
(a,9) = l 


= X 

a mod q 


e{ab/q) 


X modg 


{a,q) = l 


X 

a mod q 
(a,q) = l 


e{ab/q) 


X mod q' 


(10.7) 
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where q' = < 7 /gcd(g, ag°). Now, 
case mod 9 ' x(a"^ao) = </'(<?'))■ Thus, (l0.7| equals 

4>{q') 


, x(a ^ao) = 0 unless a = qq (in which 




e(a6/g) = 


a mod q 
(a,g) = l 
a=ao mod q' 


Hq') 

<t>{q) 


E < 

k mod q/q' 
{k,q/q') = l 


(oo + kq')b 


aob\ 

q ) 


E - 

k mod q/q' 
(k,qlq') = l 


kb \ ^ cl){q') 

q/q'J (t>{.q)^ 


apb 

q 


Tiq/q') 


provided that ( 6 , q) = 1. (We are evaluating a Ramanujan sum in the last step.) Hence, 
for a = a/q + S/x, q < x, {a, q) = 1 , 

tE E E X* {n)A{n)e{Sn/x)r]{n/x) 

X " 


equals 


Since {a,q) 
when (n, q) 


E 


—- ^A{n)e{an)r]in x). 


= 1, T(x,a) = x(a)T(x). The factor p((g,n°°))/(^(( 9 ,n°°)) equals 1 
= 1; the absolute value of the factor is at most 1 for every n. Clearly 


E =Ei°gpE^(E) • 

n p\q a.>l ^ ^ 

(n,9)5^1 


Recalling the definition (10.1 1 of Srj{a, x), we conclude that 


(ct j x'j 


1 

Hq) 


E x{a)T{x)Srj,x- 

X mod 9 



+ 0 * 


2EiogpE 

p\q q;>1 


V 


where 

Srj,xi/3,x) ='^A{n)x{n)e{l3n)T]{n/x). 

n 

Hence 5',,^ (a, x)Srj 2 (a, a:)S ',,3 (a, x)e{—Na) equals 



( 10 . 8 ) 

(10.9) 


1 

^(q)3 


EEE T{xi)T{X2)T{x3)Xi(.a)x2{a)xp{a)e{-Na/q) 

Xl X 2 X 3 


( 10 . 10 ) 


• Sm,xl d'S'92.x2 ^)Sv3,x 3 x)e{-6N/x) 


plus an error term of absolute value at most 


3 


^ E n E E 

p\q Q :>1 



X 


( 10 . 11 ) 
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We will later see that the integral of ( 10.1 l| l over is negligible - for our choices of 
r]j, it will, in fact, be of size C)(x(log a;)^), A a constant. The error term 0(a:(log x)^) 
should be compared to the main term, which will be of size about a constant times x^. 

In (lO.lOl, we have reduced our problems to estimating Srj^xi^/Xj x) for x prim¬ 
itive', a more obvious way of reaching the same goal would have made ( 10.111 worse 
by a factor of about y/q. 


10.2 The integral over the major arcs: the main term 


We are to estimate the integral (10.4 1 , where the major arcs DJlso.r are defined as in 
(10.5 1 . We will use ryi = p 2 = where 77 + and 77 * will be set later. 

We can write 

pOO 

/x,x) = Sri{5/x,x) = / rj{t/x)e{5t/x)dt-\-O*x)) ■ X 

Jo 


( 10 . 12 ) 


= pi — S) ■ X O* [eiTri^xT (<^5 x)) ■ X 
for x = XT the trivial character, and 

Sn,x{^/x) = 0*{eTVn^x^5,x)) -x 


(10.13) 


for X primitive and non-trivial. The estimation of the error terms err will come later; 
let us focus on (a) obtaining the contribution of the main term, (b) using estimates on 
the error terms efficiently. 

The main term: three principal characters. The main contribution will be given by 
the term in ( | 10 . 10 [ ) with Xi = X 2 = X 3 = Xo. where xo is the principal character mod 

q- 

The sum rjxo, n) is a Ramanujan sum', as is well-known (see, e.g., IIIK041 (3.2)]), 
T{xo,n) = E p.{q/d)d. (10.14) 

d\(q,n) 


This simplifies to p,{q/{q, n))(j){{q, n)) for q square-free. The special case tt, = 1 gives 
us that t(xo) = p{q)- _ 

Thus, the term in ([10.10 1 with Xi = X 2 = Xs = Xo equals 


e{-Na/q) 3 


4>{q) 


3 Tiq) S^+,xoi^/x,x) Sn,,xii^/x.x)e{-SN/x), 


(10.15) 


where, of course, (a, x) = Srj{a, x) (since Xo is the trivial character). Summing 
(10.15 I for a = ajq-i-Sjx and a going over all residues mod q coprime to q, we obtain 


^{cCN)) 


(j){qf 


i^/x, x fSr,,,x- {5/x, x)e{-5N/x). 
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The integral of (10.15 1 over all of 9ft = Oft^g^r (see (10.5 1 ) thus equals 


^ ^ Sl^^^.{a,x)Sr,„^i{a,x)e{-aN)da 

q odd 


+ E 

q<2r 
q even 




T{qfT{{TN)) 


The main term in (|10.16|) is 



Sl^^^.{a,x)Sq,^^>[a,x)e{-aN)da. 


(10.16) 


dpr 

■ E J iri^i-ax)fi%i-ax)e{-aN)da 

q odd 

Sqt 

E J \^^{^{-ax)f'q^{-ax)e{-aN)da. 

q^2r qx 

q even 

(10.17) 

We would like to complete both the sum and the integral. Before, we should say 
that we will want to be able to use smoothing functions 77 + whose Fourier transforms 
are not easy to deal with directly. All we want to require is that there be a smoothing 
function rjo, easier to deal with, such that rjo be close to 77 + in £2 norm. 

Assume, then, that 


\V+ - Vo\2 < eo\Vo\, 


(3) 

where 770 is thrice differentiable outside finitely many points and satisfies 770 € Ti. 

Then (10.17 1 equals 


Opr 

E , {Th{-ax)fr^^{-ax)e{-aN)da 


q^r 
q odd 


<^( 9)3 


OQ' 

■ E didfdiiT N)) [ ^ {rio{-ax))‘^fi,{-ax)e{-aN)da. 


q<2r 
q even 


0(q)3 


plus 


O* . 




(10.18) 


(10.19) 
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Here (10.19 1 is bounded by 2.82643a:^ (by (C.9 1 ) times 


/ poo poo 

l^(-a) - ^o(-a)|2da • / |^(-a) + 7fo(-a)|2da 

y J —oo J —oo 

< |r 7 *|i • 1^ - TfoblilT + ?fo |2 = |?y*|i ■ \ri+ - ?7o|2|??+ + ??o |2 

< l^*li • \v+ - ?7o|2(2|77o|2 + \v+ - ’lob) = \v*\i\ilo\l ■ (2 + eo)eo- 


Now, (|10.18|l equals 


/ OO 

{■qo{-ax)Y'q^[-ax)e{-aN) ^ 

-OO „ / 


= x^ {f]'oi—ax))^i%,{—ax)e{—aN)da- 

y,>i 

/ oo _ 

{jfo{—ax))^rf^,{—ax)e{—aN) ^ 

-OO 


m.N)) 

(biq)^ 

T ^ • ( ^0’" ^ t\H/ 

fl(qP = l 


fj,{{q,N))da 




(5,2) 

^l(qY = l 




m.N)) 

4>{qY 


The last line in (10.20 1 is boundecj^by 


p{{q,N))da. 


( 10 . 20 ) 


/ OO 

i’ro(-a)b 

-oo „ / 




KqY 

biqf 


da. 


( 10 . 21 ) 


By (2.1 1 (with k = 3), (C.16l and (C.17 1 , this is at most 


x^|?7*|i 


.,24.31004, 

/ |?7o(-a)b- da 

l-5o/2 r 


■ 2 x^ 177 * 


ho^^li V 8 . 62008 |q;| 


da 


' Sq/2 


(271 aY J Sgr 

< \r]Yi ( 4 . 3 IOO 4 I 770 I 2 +0.00113^^^^1 y. 


It is easy to see that 

biiq,N)) 


q>l p\N ^ ' p\N 


1 + 


(p - 1)^ 


^This is obviously crude, in that we are bounding N))/4>{q) by 1. We are doing so in order to 
avoid a potentially harmful dependence on N. 
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Expanding the integral implicit in the dehnition of /, 


pOO 

/ {fjo{—Oix)Y'q^{—ax)e{—aN)da = 

J 00 

J J Vo{ti)ilo{t2)r]* - {ti + ^ 2 )^ dtidt2. 


1 

X 


( 10 . 22 ) 


(This is standard. One rigorous way to obtain ( 10.22| i is to approximate the integral 
over a G (— 00 , 00 ) by an integral with a smooth weight, at different scales; as the scale 
becomes broader, the Fourier transform of the weight approximates (as a distribution) 
the S function. Apply Plancherel.) 

Hence, ( 10.17|l equals 


' j J Vo{ti)rio{t2)r]t - {h + £ 2 )^ dtidt2 


n(i 

p\N 


{p-iy 


■n(i 


(p - 1)' 


(10.23) 


(the main term) plus 


2.82643|77 o |2(2 + £0) ■ £0 + 


4.31004|?7o|i + 0.00113 




\v* 


IX 


(10.24) 


Here ( |10.23| l is just as in the classical case IIIK04I (19.10)], except for the fact that 
a factor of 1 /2 has been replaced by a double integral. Later, in chapter[^ we will see 
how to choose our smoothing functions (and x, in terms of N) so as to make the double 
integral as large as possible in comparison with the error terms. This is an important 


optimization. (We already had a hrst discussion of this in the introduction; see (1.39 1 
and what follows.) 

What remains to estimate is the contribution of all the terms of the form err^_^ (5, a;) 
in ( |10.12| i and ( 10.13| l. Let us hrst deal with another matter - bounding the £2 norm of 
15 ^( 0 ;, x)y over the major arcs. 


10.3 The £2 norm over the major arcs 


We can always bound the integral of |S'^(a, x)\‘^ on the whole circle by Plancherel. If 


we only want the integral on certain arcs, we use the bound in Prop. 12.1.2 (based on 


work by Ramare). If these arcs are really the major arcs - that is, the arcs on which 
we have useful analytic estimates - then we can hope to get better bounds using L- 
functions. This will be useful both to estimate the error terms in this section and to 
make the use of Ramare’s bounds more efficient later. 
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By ( 10 . 81 , 


E 


a mod q 
gcd(o,g) = l 


-S',) ( - + -,x 

' q X 






X x' 


\ 


E X(a)x'(a) 


a mod q 
\gcd(a,g) = l 






+ 0* (^2(1 + y^)(\ogx)‘^\r]\^max\Sn{a,x)\ + ((1 + v^)(loga;)^|??|oo)' 

^El^(x)n^.x*(^A,a;)|2+K,,i(2|5,(0,x)|+X,,i), 




where 

Kq^l = (1 + y/q){l0gxf\7j\ao- 

As is well-known (see, e.g., IIIK041 Lem. 3.1]) 


t{x)=t[ X* ) r{x*), 


where q* is the modulus of x* (i-S-j the conductor of x), and 

|t(x*)| = v^. 


Using the expressions (10.12i and (10.13 i, we obtain 


E 

a mod q 
{a,q) = l 



( a 

s \ 

Sq 


-,x 


X J 


\t]{-6)x + 0* {eiin.xTi^^x) ■ x)f 


( E 9 *-O* (|err^,x(5,x)px2)j -f A:,,i( 2|S'^(0, x)| 

i\v{-S)\^ + 0* (|err^,^^(5,x)(2|??|i -f err^,^.^((5,x))|)) 


+ Kqx) 


yx/XT 

4>{q) 


+ O* [ max q* I err^_;^. (i5, x) px^ + Kq^ 2 x] 

\x¥=xt ’ ’ / 


where AT ,,2 = Arg,i(2|5'^(0,x)|/x + Kq^jx). 
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Thus, the integral of over Wl (see (10.5 1 ) is 


E E 


, a , ^ 
q^2qx 


q^r a modg g 2qx 
q odd {a,q) — l 


\s,{a,x)\^da+Y: E 

<j<2r a mod g g gx 
q even {a,q) — l 


\Sjj{a^x)\ da 


= E 


q<r 
q odd 


Hq) y_|oi 


{—ax)\^ da + 




X / 


q< 2 r 
q even 


(/'(g) 7_^oi 


{—ax)\'^ da 


gcd(g, 2),5or ^ 1 , ^ 

" -S-(BT te(2Hi+i5T fp) 




+ y ^.o- 
St 1 

q odd 


\ 


max g*| err^^^»(5, x)|^ H- — 

X mod q ’ X 

X/XT 
\|( 5 |<< 5 or/ 2 (j 

/ 


+ y 

q even 


K„ 


max q*\eTT^^^»{S,x)f + 

X modg ’ X 

x¥=xt 
\|5|<5o»'/9 


where 


(10.25) 


ETrj^s = max | err,,_j^.j, {6, x) \ 

|5|<s 


and XT is the trivial character. If all we want is an upper bound, we can simply remark 
that 



q odd q even 


< 


E 

q<r 
^q odd 


'/'(g) 


E 

q< 2 r 
q even 


'/’(g) 


\v\ 


2MiE 

q<r 
q odd 


(/(g)' 


If we also need a lower bound, we proceed as follows. 

Again, we will work with an approximation rjo such that (a) Ip — P 0 I 2 is small, (b) 
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(3) 

Tjo is thrice differentiable outside finitely many points, (c) rjo G Li. Clearly, 


q<r 
q odd 


— |,(-aMI da 




q<r 
q odd 


= E 


q<r 
q odd 


tHq) f f- 

m \J- 

T^jq) /-^ 
(jdiq) 


<? 2 

|7fo(-a)| da + 2(|7fo| Jiy-jfol) + Ip-^o 

QT 

2q 

\r]o{-a)\^ da 


+ O* Q logr + 0.85^ (2 |?7o|2 |t? - ??o |2 + l??o - q\t) , 
where we are using (|C.ll 1 and isometry. Also, 


E 




(—ax) 


'(ia= E 


q<2r 
q even 


q<r 
q odd 


pHq) 

(t>{q) 


By(l^ and Plancherel, 

Snr- / 

/ 2g 

^ \T]'o{—a)\'^ da = / \r]'o{—a)f da — O* {2 I 

2q \ 2c 

/ |„(3)|2„5 

= \vo\l + o*i^ 

Hence 

dHq) 


(—ax)\ da. 


\Vo li 
(27ra)® 


da 


liq-^ 


y 57r3(i5or’)' 


/ 




q^r 
q odd 


q^r 
q odd 


dHq) Ivi^'^llq^ 


\ 


\ 


E ^(g) 57r6(,5or)5 

^q odd 


Using (C. 18 I, we get that 


E 

q<r 
q odd 


p^jq) ^ V 

(j){q) 57r®((5or)3 ~ r ^ 4){q) Bir^dg 

q odd 

„(3)|2 


< 


|t?' 


to ll 
Btt^Sq 


0.64787 


logr 0.425 

-1- 

4r r 


Going back to ( |10.25| l, we use ( |C.7[ ) to bound 
M^(q)x^ gcd(q,2)(5or 


^(q) 


qx 


< 2.59147-Jorx. 
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We also note that 


E- + E - 

^ q ^ q 

q<r Q'^2r‘ 

q odd q even 





1 

q 


< 21oger 


log - < log 2e^r. 


We have proven the following result. 

Let rj : [0, 00 ) —>■ IR Z7e in Li H 


Lemma 10.3.1. 

let SJt = ^So,r be as in {10.5\. Let ijo : [0,(X)) 

G Li. 


finitely many points. Assume rjo 
Assume r > 182. Then 


(3) 


Let Sjj{a, x) be as in {10.1 i and 
R be thrice dijferentiable outside 


j^\Sr,{a,x)\^da = Lr,SoX + 0* (^5.19(5oxr ^^ 

+ O* ((5or(log2e^r) {x ■ + Kr, 2 )) , 


(10.26) 


where 


Er,,r,So = max yTf\ err^^^. (5, x) \, ET^j^s = max | err^_^.j, (<), x) |, 

X modq l<5|<s 

q<r.gcd(q,2) 

\S\<gcd(q,2)5or/2q 


Kr,2 = (1 + v^)(loga;)^|?7|oo(2|5^(0,a;)|/a; + (1 + \/^)(logx)^|77|oo/a;) 

and Lr^So satisfies both 


Lr,So ^ 2|p|2 


q<r 
q odd 


T\q) 

(t>{q) 


(10.27) 

(10.28) 


and 


Lr,So =2|?7o|2E + ■ ( 2 |^°l 2 l^-^o |2 + l7o-??l2) 

q odd 


+ 0 * 


(3)|2 
57r®(5g 


2|r/, 


fo.64787+ 

\ 4r 


0.425^ 


(10.29) 

Here, as elsewhere, x* denotes the primitive character inducing x, whereas q* denotes 
the modulus of x*- 


The error term xrETjj Sor will be very small, since it will be estimated using the 
Riemann zeta function; the error term involving 2 will be completely negligible. 
The term involving a;r{r + l)i?^ ^ we see that it constrains us to have | err^ ^^,( 0 ;, iV)| 
less than a constant times 1 /r if we do not want the main term in the bound ( |10.26 1 to 
be overwhelmed. 
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10.4 The integral over the major arcs: conclusion 


There are at least two ways we can evaluate ( |10.4| i. One is to substitute ( |10.10[ ) into 
( 10.4| i. The disadvantages here are that (a) this can give rise to pages-long formulae, (b) 
this gives error terms proportional to xr\ err^ ^(x, A^)|, meaning that, to win, we would 
have to show that | err^_j|-(x, N) \ is much smaller than l/r. What we will do instead is 
to use our £2 estimate ( |10.26[ ) in order to bound the contribution of non-principal terms. 
This will give us a gain of almost ^/r on the error terms; in other words, to win, it will 
be enough to show later that | err^_^ {x, N) \ is much smaller than 1 / y/r. 


The contribution of the error terms in Srj^{a,x) (that is, all terms involving the 
quantities err^ ^ in expressions (10.12 1 and ( 10.13|) to (10.4 1 is 


H ^ H ^(^3) H X3{a)e{-Na/q) 

q^r X3 niod g a mod g 

g odd (a, 9 ) = ! 



{a + a/q^ x)^ err^^ ^^* (ax, x)e{—Na)da 


+ ^(^3) ^ X 3 {a)e{-Na/q) 

9^27* X3 mod g a mod g 

geven {a,q) — l 


(10.30) 



Sri+ (a + a/g, x)^ err,^^ (ax, x)e{—Na)da. 


We should also remember the terms in (10.11 1 ; we can integrate them over all of M/Z, 
and obtain that they contribute at most 


n -maxVlogp V77J ( — ) da 

^ - p\q a>l ^ ^ 

^ ^ tt f 

^25] n \Sr^Pa,x)\ 2 -ina^Y.^ogpY,Vj (— 

3-3 f ^3 p\q “>1 ^ 


= 2 ^ A^(n)? 7 ^(n/x) • logr • max ^ 77 * ( — 


t>i 


+ 4 A^{n)ri\{n/x) ■ ^ A'^{n)r]'^{n/x) ■ logr • max ^ 77 * | — 

y n n a>l 


by Cauchy-Schwarz and Plancherel. 
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The absolute value of (|10.30|l is at most 


q^r a mod q 2 qx 
q odd {a,q) — l 

.Nz 

^ ^ /L 

g<2r a mod q qx 

q even {a,q) — l 


Sp-r 

I |2 _ 

\Srij^{a + a/q,x)\ da- max Vq*\errjj^^^*{S,x)\ 

^ " X mod q ’ 

\S\<Sor/2q 


\STj.{a + a/q,x)\ da- max ^/^\erTrj,,x*{S,x)\ 

X mod q 
\S\<Sor/q 


< 




(a)| da- max y/^\eriri.,,x*{S,x)\. 


X modg 
q<r-gcd(ij,2) 
|i5|<gcd(g,2)<5or-/g 


(10.31) 

We can bound the integral of \Sri^ (o^) P by (10.26 1 . 

What about the contribution of the error part of Srf^{a^x)l We can obviously 
proceed in the same way, except that, to avoid double-counting, Sjj^{a,x) needs to 
be replaced by 

1 - fJ-iq) 


^^(Xo)?r3(-<5) • a; = • a;, 


(10.32) 


which is its main term (coming from (|10.12|)). Instead of having an norm as in 
( 10.31| l, we have the square-root of a product of two squares of £2 norms (by Cauchy- 
Schwarz), namely, \S*^{a)\'^da and 


E 

q^r 
q odd 




M (g) 

(j){qy y_4oi 


aa;)a;|^ da-1- ^ 


g<2r 
q even 


T^q) /■- 

y_£oi 


|f^(—aa:)a:| da 


(10.33) 


< xlvlll - 


T\q) 

(t>{qy' 


By ( |C.9| l, the sum over q is at most 2.82643. 

As for the contribution of the error part of 5^^ (a, a:), we bound it in the same way, 
using solely the £2 norm in (10.33 1 (and replacing both and Srj^{a,x) by 

expressions as in (10.32 1 ). 

The total of the error terms is thus 

X - 


max • I err^^^^. ((5, a;)| • A 

X mod q ’ 

q'<r-gcd(g',2) 

\6\<gcd{q,2)Sor/q 

+ X * max Vq*- |err^ 

X mod q 
q<r.gcd{q,2) 

\5\<gcd(q,2)Sor/q 


(10.34) 


where A = {1/x) (a, a;) pda (bounded as in (10.26 1 ) and 


= 2.82643|p*p, B+ = 2.82643|7;+p. 


(10.35) 
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In conclusion, we have proven 

Proposition 10.4.1. Let a; > 1. Let ??+, p* : [0, oo) —)■ K. Assume G C^, ry" € L 2 
and P+, 77 * S n L^. l^t rjo '■ [ 0 , cx)) —> K Ijc thrice differentiable outside finitely 

/Q\ 

many points. Assume G Li and |p+ — P 0 I 2 < eo|Po| 2 . where eg > 0. 

Lef Srj[a,x) = A(n)e(an)p(n/x). Lef x primitive, be given as in 

1 10.12 1 and \ 10.13\ . Let i5o > 0, r > 1. Let DJI = be as in {10.5). 

Then, for any N > 0, 


/ Sri^{a,x)^ Sri,(a,x)e{—Na)da 

Jw 


equals 


, 4.31004|po|i+ 0.0012lVi , 

+ 2.82643|po|2(2 + co)' + -- ° j 


+ 0 * {E^^ r,So^ri+ + £’?)+ ,r-,(5o ’ 1-6812(^^4^ + 1.6812|p+1 2 ) |p* 12) ’ 


+ 0 * ( 2 Z^ 2 ^ 2 ix)LSn, [x, r) ■ x + Z^ 2 ^^ 2 (,x)Z^ 2 ^ 2 {x)LSr,^_ {x, r) ■ xj 

(10.36) 


where 


co=nfi-7r^Vnfi 


p\N 


+ 




Cr,„,ri, = J J Po(tl)po(t2)p* “ (^1 + ^ 2 )^ dtidt2, 


(10.37) 


En,r,5o = a/^-| err^.x*(^>^)l> ETr,^s = max | err^,^,j,(^>a;)|, 

X mod q l^l<s/9 

q<gcd(i 3 , 2 )-T- 
|5|<gcd(ij,2)(5or-/2(/ 


A.J} f I(o(, x)I dot^ ^ ^ ^ 


q<r 
q odd 


Hq) 


Kr,2 = (1 + V^)(loga:)^|p|oo(2^^,i(a;)/a; + (1 + v^)(logx)^|p|oo/a;), 


Zn,k{x) = - A''(n)p(n/x), LS^{x,r) = logr • max p ( — ) > 

X p<r \ T- / 

n cii>l 


(10.38) 


and is as in {10.12) and {10.13). 


Here is how to read these expressions. The error term in the first line of (10.36 1 


will be small provided that eo is small and r is large. The third line of ( |10.36 1 will 
be negligible, as will be the term 2(5o7'(log er)76^,2 in the definition of (Clearly, 
Zn,k{x) (loga;)^“^ and LSrf{x,q) T{q) log a; for any p of rapid decay.) 
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It remains to estimate the second line of (10.36 1 . This includes estimating Arj - 

We see that we will have to 


a task that was already accomplished in Lemma 10.3.1 


give very good bounds for Ejj^r,So when 77 = or 77 = p*. We also see that we want 
to make CoCri^^r],x^ as large as possible; it will be competing not just with the error 
terms here, but, more importantly, with the bounds from the minor arcs, which will be 
proportional to 177 + 1 | 177 * 11 . 
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Chapter 11 

Optimizing and adapting 
smoothing functions 


One of our goals is to maximize the quantity in ( 10 . 37 [ ) relative to |??o|2|t?*|i- 

One way to do this is to ensure that (a) 77* is concentrated on a very shorj^interval [ 0 , e), 
(b) rjo is supported on the interval [0, 2], and is symmetric around t = 1, meaning that 
r]o{t) ^ 770(2 — f). Then, for x N/2, the integral 



Voiti)Vo{t2)V* 



(ti + ^ 2 ) 


dtidt2 


in (| 10 . 37 [) should be approximately equal to 


\v*\i 



dt = |? 7 *|i • 



iloitYdi = |77*|i • |?7o|2, 


( 11 . 1 ) 


provided that 770 (<) > 0 for all t. It is easy to check (using Cauchy-Schwarz in the 
second step) that this is essentially optimal. (We will redo this rigorously in a little 
while.) 

At the same time, the fact is that major-arc estimates are best for smoothing func¬ 
tions 77 of a particular form, and we have minor-arc estimates from Part|I]for a different 
specific smoothing 772. The issue, then, is how do we choose 770 and 77* as above so that 


• 77* is concentrated on [0, e). 


• rjo is supported on [0, 2] and symmetric around t = 1, 


• we can give minor-arc and major-arc estimates for 77*, 


• we can give major-arc estimates for a function close to rjo in £2 norm? 


^This is an idea appearing in work by Bourgain in a related context IBou99l . 
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11.1 The symmetric smoothing function r]o 


We will later work with a smoothing function whose Mellin transform decreases 
very rapidly. Because of this rapid decay, we will be able to give strong results based 
on an explicit formula for pcj. The issue is how to define 770, given rycj, so that ryo is 
symmetric around t = 1 (i.e., 770(2 — x) ~ 770 (x)) and is very small for x > 2 . 

We will later set q^it) = Let 


h : 1 1-^ 


f3(2-f)3e‘-i/2 

0 


ifi e [ 0 , 2 ], 

otherwise 


We define 770 : M —?► M by 


Vo{t) 


h{t)q^{t) = 


f3(2-f)3e-(*-i)'/2 

0 


iff G [ 0 , 2 ], 
otherwise. 


It is clear that 770 is symmetric around f = 1 for f G [ 0 , 2 ]. 


( 11 . 2 ) 


( 11 . 3 ) 


11.1.1 The product ? 7 o((:)? 7 o(p — t). 

We now should go back and redo rigorously what we discussed informally around 
( |ll.l| i. More precisely, we wish to estimate 

/ OO pOO 

qo{t)qo{p - t)dt = / qo{t)qo{2-p + t)dt ( 11 - 4 ) 

-OO j —OO 

for p < 2 close to 2 . In this, it will be useful that the Cauchy-Schwarz inequality 
degrades slowly, in the following sense. 

Lemma 11.1.1. Let V be a real vector space with an inner product (•, •). Then, for 
any v,w G V with [w — v \2 < ^[ 2 / 2 , 

{v,w) = 1771217x12 + 0*(2.71\v - wll). 


Proof. By a truncated Taylor expansion. 


n - . a; x 

Vl + x = l + — + — max 


2 2 o<t<i 4(1 — (te) 2 ) 3/2 


= i + - + o- 


I 23/2 ) 


for |x| < 1 / 2 . Hence, for 5 = [tc — 7;|2/|77|2, 


H2 ^ ^ 2(77; - 7 ;,x) + [tx - v\l ^ ^ 


2 + <52 


LI 


M2 V |X |2 


<52 = 1 


(tx — X, x) 


o* 


+ O* 2.71 


{25 + 5^f 

23/2 


Ix |2 
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Multiplying by juH, we obtain that 

= \v\'i + {w - v,v) + O* (2.71|w - v\l) = {v,w) + O* (2.71|w - v\l) . 


□ 


Applying Lemma fl 1.1.1 1 to ( |1 1.4[ ), we obtain that 

/ OO 

T]o{t)r]o{{2 - p) -i- t)dt 

-OO 


\Vo{tWdt, 


ho((2 - p)+t)\'^dt 


TO* [ 2.11 J \po{t) - Po{{2 - p)+t)\^dt^ 

= \po\l + 0* (^ 2.71 'Woir + t)\ 


dr dt 


(11.5) 


= l» 7 o |2 + o* (^2.71(2 - p) / f \ri'^{r + t)f dtdr 
\ Jo J -00 

= \Po\1 + 0*{2.71{2-p)W2)- 


We will be working with 77 * supported on the non-negative reals; we recall that rjo 
is supported on [0, 2]. Hence 



J 'no{ti)r]o{t2)'il* - {ti + dtidt2 


ivo *Vo){p)il* { — - p]dp 


(IP 0 I 2 + 0*(2.71(2 - p)^|77ol2)) ■V*{^-P]dp 


= \Vo 


77*(p)dp + 2.71|p;|^0* 


{(2- N/x) + p)‘^p4p)dp 


( 11 . 6 ) 


provided that N/x > 2. We see that it will be wise to set N/x very slightly larger than 
2. As we said before, p* will be scaled so that it is concentrated on a small interval 
[0,e). 

11.2 The smoothing function r/*: adapting minor-arc 
bounds 


Here the challenge is to dehne a smoothing function 77 * that is good both for minor-arc 
estimates and for major-arc estimates. The two regimes tend to favor different kinds of 
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smoothing function. For minor-arc estimates, we use, as flTaol4ll did, 

r] 2 {t) = 4max(log2 - |log2f|,0) = ((2/[i/2,i]) *m (2/[i/2,i]))(f), (11.7) 

where I[i/ 2 ,i]{t) is 1 if f € [1/2,1] and 0 otherwise. For major-arc estimates, we will 
use a function based on 

/o 

rjc:? = e ^ . 

We will actually use here the function t^e~* whose Mellin transform is Mr]'y{s + 2) 
(by, e.g., lIBBOlOl Table 11.1]).) 

We will follow the simple expedient of convolving the two smoothing functions, 
one good for minor arcs, the other one for major arcs. In general, let (pi,if 2 ■ [0, oo) — 
C. It is easy to use bounds on sums of the form 

Sf,vi{x) ='^f{n)(pi{n/x) ( 11 . 8 ) 

n 


to bound sums of the form 


Sf, 




^/(n)((/3i P 2 ) 


( — ) T2(w) 
\wx/ 


, dw 
w 



Sf^^^{wx)(f2{w) 


dw 

w 


(11.9) 


The same holds, of course, if (pi and ip2 are switched, since pi *m T2 = T2 *m Ti- 
The only objection is that the bounds on ( |11.8| l that we input might not be valid, or 
non-trivial, when the argument wx of Sf^ip^{wx) is very small. Because of this, it is 
important that the functions (pi, p 2 vanish at 0 , and desirable that their first derivatives 
do so as well. 

Let us see how this works out in practice for pi = 772 . Here 772 : [0, 00 ) —>■ K is 
given by 


m =Vi *mVi =4max(log2- |log2f|,0), (11.10) 


where 771 = 2 • /[i/ 2 ,i]. _ 

Let us restate the bounds from Theorem lS.l.ll - the main result of Part||l We will 
use Lemma C.2.2 to bound terms of the form q/(j>{q). 

Let X > xo, xo = 2.16 • 10^°. Let 2q; = a/q + 5/x, q < Q, gcd (a,(7) = 1, 
\5/x\ < IjqQ, where Q — (3/4)x^/^. Then, if 3 < <7 < Theorem 3.1.1 gives 

us that 


\Sr, 2 ia,x)\ < gx max 


’ 8 


■q x 


( 11 . 11 ) 


where 


9x{r) 


{Rx, 2 r log2r -I- 0.5)a/ F{r) + 2.5 ^ L 2 r g 36a;“^/® 

r 


( 11 . 12 ) 
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with 


R^^ = 0.27125 log 1 + 


13 


log At 


2 log 9x1/3 
^ 2.004t , 


0.41415 


Lt = F{t/2) ( — log t + 7.82 ) + 13.66 log t + 37.55, 


If g > a;^/^/6, then, again by Theorem 


3.1.1 


\Sri 2 io:,x)\ < h{x)x, 


(11.13) 


(11.14) 


where 

h{x) = 0.276x“^^®(logx)^/^ + 1234x“^/^ logx. (11.15) 

We will work with x varying within a range, and so we must pay some attention 
to the dependence of ( |ll.ll| i and ( |11.14| i on x. Let us prove two auxiliary lemmas on 
this. 


Lemma 11.2.1. Let gx{r) be as in 

X I—■ 


\11.12 ) and h(x) as in 


h{x) ifx < (6r)^ 
gx{r) ifx > {6rf 


( 77.75| ). Then 


is a decreasing function of x for r > 11 fixed and x > 21. 

Proof. It is clear from the definitions that x i— h{x) (for x > 21) and x i— gx{x) are 
both decreasing. Thus, we simply have to show that h{xr) > gxr-{f) for Xr = (6r)^. 
Since > (6 • 11)^ > 


Rx^, 2 r < 0.27125log(0.065log+ 1.056) + 0.41415 

< 0.271251og((0.065 + 0.0845) log +0.41415 < 0.27215 log log x^. 

Hence 

Rx^, 2 r log2r + 0.5 < 0.27215 log log x^ log x^^ - 0.27215 log 12.5 log 3 + 0.5 
< 0.09072 log log Xr logXr — 0.255. 


At the same time, 

^ 1/3 

r (r) = log log 

6 

< e''" log log Xr 


2.50637 
log log r 


< e'*' log log Xr 


eT" log 3 + 1.9521 


(11.16) 


for r > 37, and we also get F (x) < e'*' log log Xr for r G [11, 37] by the bisection 
method with 10 iterations. Hence 

{Rx^,2r log 2r + 0.5) + 2.5 

< (0.09072 log log Xr log Xr — 0.255) a/ e'^ log log Xr + 2.5 

< 0.12111ogXr(loglogXr)^^^ + 2, 
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and so 


(Rx 2r T 0.5) \/ /^(r) “1-2.5 , . \‘^/9 \ i/k 

^ ^^ ’ - < (0.21 logx^(loglogx^)3/2-h3.47)x-i/®. 

V 2r 

Now, by ( 11.16| l, 

L2r < e'^ \og\ogXr ■ log(xy^/3)-l-7.82^ -I-13.661og{xy^/3)-I-37.55 


< e'*' log log Xr 


13 

12 " 


Xr + 4.25 ) -1-4.56 log Xr + 22.55. 


It is clear that 


4.25e'>' log log Xr + 4.56 log Xr + 22.55 

x ^^/6 


< 1234a;j. loga:^. 


for Xr > e: we make the comparison for Xr = e and take the derivative of the ratio of 
the left side by the right side. 

It remains to show that 

0.21 log a;^.(log log + 3.47 + 3.36 + loga;,. log log a;^. (11.17) 

is less than 0.276(loga;r)^/^ for Xr large enough. Since t i—)■ (logf)^/^/f^/^ is de¬ 
creasing for t > e^, we see that 

0.21 loga;r(loglogXr.)^'^^ + 6.83 + loga;^ logloga;^ ^ ^ 

0.276(logx^)3/2 ^ ^ 


for all Xr > 6^3, simply because it is true for X = e33, which is greater than . 

We conclude that h(xr) > gxrif) = /&) for Xr > e33. We check that 

h(xr) > Qxri^V^IQ) for loga;^ G [log663,33] .^gjj |^y bisection method 
(applied with 30 iterations, with loga:^ as the variable, on the intervals [log 663, 20], 
[20, 25], [25, 30] and [30,33]). Since r > 11 implies Xr > 663, ^g (jone. □ 


Lemma 11.2.2. Let Rx^r 
3 log 6r. 


be as in {11.12 1. 


Then t —>■ R^t rir) is convex-up for t > 


Proof. Since t ^ e and f —f are clearly convex-up, all we have to do is to show 
that t Ret,r is convex-up. In general, since 


(log/)" = 



n - w? 

p 


a function of the form (log f) is convex-up exactly when f" f — (f'Y > 0. If fit] = 
1 + a/{t - b), we have f"f - (f > 0 whenever 

(t -t- a — b) ■ (2a) > a^, 


i.e., + 2at > 2ab, and that certainly happens when t > b. In our case, b = 

31og(2.004r/9), and sot> 31og6r implies t >b. □ 
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Now we come to the point where we prove bounds on exponential sums of the form 
Srj, {a, x) (that is, sums based on the smoothing 77 *) based on our bounds ( 11.111 and 
(11.14 1 on the exponential sums (a, x). This is straightforward, as promised. 

Proposition 11.2.3. Let x > Kxq, xq = 2.16 • 10^°, K > 1. Let S^{a,x) be as 
in {10.1 1. Let 77 * = 772 *m where 772 is as in { 11.10\ and ip : [0, 00 ) —>■ [0, cx)) is 
continuous and in L^. 

Let 2a = afq-\-Slx, q < Q, gcd{a,q) = 1, \5/x\ < 1/qQ, where Q = (3/4)a;^/^. 
If<l < {x/Ky/^/O, then 


^max ^1, q 


\ix, 


(11.18) 


where 


/ N _ {Rx.K,ip,2r log 2r + 0.5)^yr{r) + 2.5 ^ L2r ^ 


Rx.K.ip,t Rx,t “f {Rx/K.t Rx.t) 


Cip.2 ,K/\t\i 

XogK 


with t and Lf are as in {11.13 ( and 


(11.19) 


a 




h/K 


ip(w) log 77 ; dw. 


Ifq > [x/Id), then 

< h^{x/K) ■ \ip\ix, 

where 

h^{x) = h{x) + 

G/K 

Cp,o,K = 1.04488 / 1 79 ( 777 ) 1 ^ 77 ; 

Jo 


and h{x) is as in {11.15). 
Proof. By ( |1 1.9 1 , 

to 


( 11 . 20 ) 


( 11 . 21 ) 


S„^{a,x) = / Sr.Ja,wx)ip{w) -h / S„Aa,wx)ip{w) 

Jo W Ji/ii 


dw 

w 


We bound the hrst integral by the trivial estimate |5'^2(Q;,7i;a:)| < |5'^2(0,7(;a7)| and 

Cor. 031 


7(777 7*^/^ 

/ |5',,„(0, 777a:)|(/5(a;)—< 1.04488 / wxpiw) 

Jo w Jo 

ip{w)dw. 


IjK 

IIK 


dw 

w 


= 1.04488a: ■ 


70 
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If w > l/K, then wx > xg, and we can use ( |ll.ll| l or (11.141. If g > (xj jd, 
then \Srf 2 {oL,wx)\ < h{x/K )wx by ( 11.14| ; moreover, < h {y)y fo r 

xjK <y< (6qY (by ( |ll.l4 i) and \Srj[{a,y)\ < gy,i{r) fory > (6g)^ (by ( ll.ll| l). 
Thus, Lemmafl 1.2.1 [gives us that 


, dw 
w 


poo dxV poo ^ 

/ \S.n 2 {<^,Wx)\(p{w )—< / h{x / K)wx ■ (p{w)- 

Jl/K w Ji/x 

pOO 

= h{x/K)x / ip(w)dw < h{x/K)\y}\i ■ X. 

JljK 

If g < {xjjd), we always use (11.11 1 . We can use the coarse bound 

[ 3.36a;“^/® • wx ■ (p{w)‘^ < 3.36iT^/®|(/?|ia;®'^® 

Jl/K W 

Since Lr does not depend on x, 

f°° Lr dw Lr. . 

/ - ■ wx ■ ifilw) —<— wux. 

Jl/K r w r 


By Lemma 


11 . 2.2 


and q < {xjKY^^Jd, y is convex-up and decreasing 


fory e [log(a;/f4r), oo). Hence 

\Rx,t 

Therefore 

^OO 


if w > 1. 


/ Rwx,t ■ wx ■ ip{w) - 

Jl/K W 


< 


< 


h/K 


/ logic 

Vlogl 


T^oo/K,! + “ log r ) Rx,t‘f{w)xdw 


where 


X f 

Rx.tX ■ / ip{w)dw + {Rx/K,t - Rx.t)-, - TP / p{w) \ogwdw 

Jl/K yogK 

< (^RxYtIi + {Rx/K,t - -^^■«) log^ ^ ■ 

/ tp{w) log w dw. 

Jl/K 


ip,2,K 


□ 


We finish by proving a couple more lemmas. 


Lemma 11.2.4. Let x > K > 1. Let 77 * = 1/2 *m T’ where 772 L as in \11.10\ and 
(fi : [0, 00 ) —>■ [0, 00 ) is continuous and in L^. Let gx^tp be as in {11.191 
Then gx,cp(r) is a decreasing function of r for 670 < r < {xjK)^'^l6. 
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Proof. Taking derivatives, we can easily see that 


r I—>■ 


log log r 


T I—^ 


logr 


r I—>■ 


log r log log r 


r I—>■ 


(logr)^ log logr 


( 11 . 22 ) 


are decreasing for r > 20. The same is true if log logr is replaced by F{r), since 
F(r )/ log l ogr is a decreasing function for r > e. Since (C'i ^,2 .K/|v3|i)/iogit: < 1 
(by (11.20 1 ), we see that it is enough to prove that r i—>■ Ry^ 2 r log 2r Vlog log r/is 
decreasing onr for y = x and y = x/K (under the assumption that r > 670). 

Looking at (|1 1.13[) and at \\l.22\, we see that it remains only to check that 


r i-> log 1 + 


log 8r 
2 log 

^ ^og 4.008r , 


log 2r ■ 


log log r 


(11.23) 


is decreasing on r for r > 670. Taking logarithms, and then derivatives, we see that we 
have to show that 


1 £ I log 8r 
21'^" 


1 + 

where ^ = log — 




2£ 
1/3 

4.0^ 

1 


1 1 1 
r log 2r 2r log r log log r 2r ’ 


. We multiply by 2r, and see that this is equivalent to 


(2 i+i^) 


log ( 


1 + 


log 8r 
21 


2 

log 2r 


1 


log r log log r 


< 1 . 


(11.24) 


A derivative test is enough to show that s/ log(l + s) is an increasing function of s for 
s > 0; hence, so is s • (2 — 1/(1 + s))/log(l + s). Setting s = (log8r)/£, we obtain 
that the left side of ( |1 1.24[ ) is a decreasing function of £ for r > 1 fixed. 

Since r < I > log 54/4.008 > 2.6. Thus, for (11.241 to hold, it is enough 

to ensure that 


2^6(2 i+ 4 i^) 

log ( 


1 I logSr 

5.2 


+ 


1 


log 2r log r log log r 


< 1 . 


(11.25) 


A derivative test shows that (2 — l/s)/log(l + s) is a decreasing function of s for 
s > 1.23; since log(8 • 75)/5.2 > 1.23, this implies that the left side of (11.25i is a 
decreasing function of r for r > 75. 

We check that the left side of ( |11.25| l is indeed less than 1 for r = 670; we conclude 
that it is less than 1 for all r > 670. 

□ 

Lemma 11.2.5. Let x > 10^®. Let </> : [0, 00 ) —>■ [0, 00 ) be continuous and in L^. Let 
9x,4>{f) ond h{x) be as in {11.19) and \1L15), respectively. Then 


9x,(() 


^4/15 


> h{2x/logx). 











































226 CHAPTER 11. OPTIMIZING AND ADAPTING SMOOTHING FUNCTIONS 


Proof. We can bound gx, 4 ,{r) from below by 

{Rx,r log 2r + 0.5) \/F{r) + 2.5 


gmx{r) = 


■\/2r 


Let r = (3/8)a;'^/^^. Using the assumption that x > 10^®, we see that 

log( 


Rx,r = 0.27125 log 1 + . 




Hog ( 


2.004 


9 ^.^ 1 - 4/15 


). 


0.41415 > 0.63368. 


(11.26) 

(It is easy to see that the left side of (11.26 1 is increasing on x.) Using x > 10^^ again, 
we get that 

Fir) = eC log log r + -- > 5.68721. 

log log r 

Since log2r = (4/15) log a: + log(3/4), we conclude that 

0.40298 log a:+ 3.25765 


grrixir) > 


y^-x2/15 


Recall that 


We can see that 


0.276(loga;)^/^ 1234 log a; 

2;1/6 0^1/3 

(loga: + 3.3)/x2/i5 
(log(2a;/ loga;))3/2/(2a;/ loga;)^/® 


(11.27) 


is increasing for x > 10^^ (and indeed for x > e^^) by taking the logarithm of the 
right side of (11.27 1 and then taking its derivative with respect to f = log a;. We can 
see in the same way that (l/a;^/^®)/(log(2a;/loga;)/(2a;/loga;)^/^) is increasing for 
X > Since 


0.40298(logx + 3.3) 0.276(log(2a:/loga:))^/^ 

\/374-a:2/i5 “ (2a:/log a;) 1/6 

3.25765 - 3.3 • 0.40298 12341og(2a;/log(a:)) 

y^-a;2/i5 “ (2a;/log(x))i/3 

for X = 10^®, we are done. 

□ 


















Chapter 12 

The £2 norm and the large sieve 


Our aim here is to give a bound on the £2 norm of an exponential sum over the minor 
arcs. While we care about an exponential sum in particular, we will prove a result valid 
for all exponential sums S{a, x) = a„e(an) with a„ of prime support. 

We start by adapting ideas from Ramare’s version of the large sieve for primes to 
estimate £2 norms over parts of the circle (i 12.1 1 . We are left with the task of giving 
an explicit bound on the factor in Ramare’s work; this we do in { 12.2 As a side effect, 
this finally gives a fully explicit large sieve for primes that is asymptotically optimal, 
meaning a sieve that does not have a spurious factor of e'’' in front; this was an arguably 
important gap in the literature. 


12.1 Variations on the large sieve for primes 

We are trying to estimate an integral |5'(a) pda. Instead of bounding it trivially by 
I S'! 00 1 S'! 2 , we can use the fact that large (“major”) values of S{a) have to be multiplied 
only by |S'(a)pda, where fOT is a union (small in measure) of major arcs. Now, 
can we give an upper bound for |5'(a) pda better than |>S'(a)pda? 

The first version of IHelbll gave an estimate on that integral using a technique due to 
Heath-Brown, which in turn rests on an inequality of Montgomery’s ( OMonTlI (3.9)]; 
see also, e.g., IIIK04I Lem. 7.15]). The technique was communicated by Heath-Brown 
to the present author, who communicated it to Tao, who used it in his own notable work 
on sums of five primes (see IITaol4l Lem. 4.6] and adjoining comments). We will be 
able to do better than that estimate here. 

The role played by Montgomery’s inequality in Heath-Brown’s method is played 
here by a result of Ramare’s ( IIRam09l Thm. 2.1]; see also IIRam091 Thm. 5.2]). The 
following proposition is based on Ramare’s result, or rather on one possible proof of 
it. Instead of using the result as stated in IIRam09L we will actually be using elements 
of the proof of IIBom741 Thm. 7A], credited to Selberg. Simply integrating Ramare’s 
inequality would give a non-trivial if slightly worse bound. 
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Proposition 12.1.1. Let a„ € C, be supported on the primes. Assume that 

{an} is in l\ n £2 and that a„ = Q for n < ^Jx. Let Qq > 1, 5q > 1 be such that 
^oQo ^ ■ssf Q = a/ xl2So > Qq. Let 


U U (“ 

q^Qo a mod q ' 
{a,q) = l 


Spr a ^ Sqt 
qx ’ q qx 


( 12 . 1 ) 


Let S{a) = J2n ctn^ian) for a G K/Z. Then 



|5'(a)|^ da < 




max max 


GqiQo/sq) 


\q<Qo s<Qo/q Gq(Q/sq) 



where 


Gq{R) = 


E 

r<R 


T\r) 

fir) ' 


(r,9) = l 


Proof. By (12.1 1 , 




E 

a mod q 


(a.q) = l 



Thanks to the last equations of 0Bom74l p. 24] and 0Bom741 p. 25], 


( 12 . 2 ) 


(12.3) 


E 

a mod q 



2 


q-lq X modq* 

(q-,q/qO = l 
fi^(qlqO = t 


2 


'^anXin) 


for every q < y/x, where we use the assumption that n is prime and > \Gc (and thus 
coprime to q) when an f 0. Hence 



|S'(a)|^ da 


£ 0 ^ 

(q\q/q-) = l 
^L^{q|q*) = t 


a„e(an)x(n) 

n 


2 

da 


= E 

q*<Qo 


q* 

fit) 


E 

i'<Qalq* 

(r,9*) = l 


T\r) 

fir) 



T.' 

X modg* 


a„e(Q;n)x(n) 

n 


da 


= E 

q*<Qo 


q* 

fid*) 



E 

r<|0 

(r,q*) = l 


P^jr) 

fir) ^ . 

X mod q 


a„e(an)x(n) 

n 


2 

da 
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Here |a| < ^oQo/ 9 *a; implies {QQ/q)5Q/\a\x > 1. Therefore, 

Gg4Qo/sq*) 

Gq-{Q/sq*) 


/ \S(a)\^ da < I max max 

JOT \g*<Qo s<Qo/q* 


where 


E 


, ^qQq 


Q*<Qo 


(j){q*) J_^o^ 




y^a„e(an)x(n) 


(12.4) 


da 


<y ^ y 

ri\( 


H^{r) 


<^min(l,^) xmod?* 

(r,(j*) = l 

2 


4‘iQ) 4>{f) J-^oQ 


E 


q<Q r<Qlq 

{r,q) = l 


qrx. mod q 

As stated in the proof of 0Bom741 Thm. 7A], 


y^ane[an)x{n) 


da. 


x(r)x(n)r(x)cr(n) = ^ 

h^l 

{h,qr) — l 

for X primitive of modulus q. Here Cr{n) stands for the Ramanujan sum 


C,(n)= y 

u mod r 
{u,r) — l 


For n coprime to r, Cr{n) = p(r). Since x is primitive, |t(x)| = -^/q. Hence, for 
r < yH: coprime to q. 


y^ane{an)x(n) 


qr 


{b,qr) — l 


Thus, 


^ = E E 

q<Q r<Q/q 


(j){rq) 


(r,q) = l 



E’ 

X mod q 


E 


{b,qr) — l 



< E — 



E 

X mod q 


E 

b^l 

{b.q)^l 



2 

da 


= E 

q<Q 



E 

6=1 

(M)=1 


S' 



2 

da. 


2 

da 
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Let us now check that the intervals {b/q — SoQ/qx, b/q + SoQ/qx) do not overlap. 
Since Q = ^xI25q, we see that boQjqx = l/2qQ. The difference between two 
distinct fractions b/q, b'/q' is at least 1/qq'. For q, q' < Q, 1/qq' > l/2gQ + ll2Qq'. 
Hence the intervals around h/q and b'/q' do not overlap. We conclude that 


S</ |^(a)|" = ^ 

JM./'L ^ 

and so, by ( 12.4| >, we are done. 


anp, 


□ 


We will actually use Prop. 12.1.1 in the slightly modihed form given by the follow¬ 
ing statement. 

Proposition 12.1.2. Let {an}^^i, € C, be supported on the primes. Assume that 

{a„} is in l\ n £2 and that a„ = 0 for n < ^/x. Let Qq > 1, i 5n > 1 be such that 
^oQo ^ set Q = ^/xj2fo > Qo- Let 911 = 9115(,^Q|3 be as in {10.5). 

Let S{a) = o-ne{an) for a € K/Z. Then 




|S'(a)| da < max max 


Gg{2Qo/sq) 


q<2Qo s<2Qo/q Gg{2Q/sq) 


E 


a„P, 


where 


Gg{R)= Y. 

r<R 
{r,q) = l 


T\r) 

fir) ■ 


Proof By (fT03]l, 

[ \S{a)f da = E / 
Jm J~ 


q<Qo ’ 

q odd 

+ E 

q<Qo' 

q even 


^qQq 

2qx 

^oQo 

2 qx a mod q 
(a,q) = l 

, '^oQq 

qx a mod q 
{a,q) = l 


S L 


s 


+ a 


da 


da. 


(12.5) 


We proceed as in the proof of Prop, 
equals 


12 . 1.1 


E 

q*<Qo 

g* odd 




, <^oQq 

2q* X 


E 


We still have (12.3 1 . Hence |<S'(q!)| da 
pf{r) 


E 


f(q*) 7_loOo fir) 

2 ,*X StL-) xmodg* 

— q* \ ’ 2|q|x y 


a„e(an)x(n) 


da 


{r,2q‘) = l 


■ E 

q*<2Qo 
q* even 


fiq*) 


. '^0^0 
_ <^oQo 


E 


— R* 




fir) 


E* 

X mod g* 


a„e(an)x(n) 


da. 


{r,q*) = l 
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(The sum with q odd and r even is equal to the first sum; hence the factor of 2 in front.) 
Therefore, 

/ 


|5'(a)| da < max max 


G 2 q- (Qo/sq*) 




9*<Qos<Qo/9* G2q *{ Q / sq *) 

y q* odd 


Gq.{2Qo/sq*) 
q‘<2Qo s<2Qo}q* Gq»{2Q/sq*) 

q* even 


max max 


2Ei 


•S 2 , 


( 12 . 6 ) 


where 




9<Q 
q odd 


</>(?) 


E 

r<Qlq 
(r,2g) = l 


= y^ y 


q<Q 

q odd 




q<2Q 
q even 




r<2Q/q 
(r,q) = l 
r even 

E 

r<2Qlq 

{r,q)^l 


4>{r) 

yy 

4>{r) 


2qj'x _—^:4c 

■SqQ ^ 

2grx X mod q 


,^oQ 


E‘ 

X mod q 




4>{r) J_§oQ 


E’ 

X modg 


yane{an)xin) 

n 

yane{an)x{n) 

n 

y^a„e(an)x(n-) 


da 


da. 


da. 


The two expressions within parentheses in (12.6 1 are actually equal. 
Much as before, using IIBom741 Thm. 7A], we obtain that 


Si 


si<E 

q<Q 

q odd 

S2< ^ 


1 


, <SqQ 

2qx 


(biq) I ^oQ 

g odd {b,q) — l 

^ SqQ q 


E 

.q)- 

q 

E 


51 -+a 


q<2Q 
q even 


(j){q) J_^_oQ 

qx 0—L 

{b,q) = l 


S[-+a 


da. 


da. 


Let us now check that the intervals of integration {b/q — 6QQl2qx, b/q + SoQ/2qx) 
(for q odd), {b/q — SoQ/qx, b/q + 5oQ/qx) (for q even) do not overlap. Recall that 
SoQ/qx = l/2qQ. The absolute value of the difference between two distinct fractions 
b/q, b'/q' is at least 1/qq'. For q,q' < Q odd, this is larger than 1/IqQ + \/AQq', 
and so the intervals do not overlap. For q < Q odd and q' < 2Q even (or vice versa), 
1 / 77 ^ > 1 / 49(3 + again the intervals do not overlap. If q < Q 

and 9 ' < (3 are both even, then \b/q — b'/q'\ is actually > 2 / 99 '. Clearly, 2/qq' > 
1 / 29(3 + 1/2(39/ so again there is no overlap. We conclude that 

2Ei + E2< f \S{a)f=y\ay. 


□ 
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12.2 Bounding the quotient in the large sieve for primes 

The estimate given by Proposition [TzTTTJinvolves the quotient 


GgiQo/sq) 
q"<Qoe<Qo}q Gq{Q/sq) 


max max 


(12.7) 


where Gq is as in ( 12.2|i. The appearance of such a quotient (at least for s = 1) 
is typical of Ramare’s version of the large sieve for primes; see, e.g., IIRam09l . We 
will see how to bound such a quotient in a way that is essentially optimal, not just 
asymptotically, but also in the ranges that are most relevant to us. (This includes, for 
example, Qq ^ 10®, Q ~ 10^®.) 

As the present work shows, an approach based on Ramare’s work gives bounds that 
are, in some contexts, better than those of other large sieves for primes by a constant 
factor (approaching e'*' = 1.78107...). Thus, giving a fully explicit and nearly optimal 
bound for (12 . 7[ ) is a task of clear general relevance, besides being needed for our main 
goal. 

We will obtain bounds for Gq{Qo/sq)/Gq{Q/sq) when Qo ^ 2 • 10^®, Q > Qq. 
As we shall see, our bounds will be best when s = g = 1 - or, sometimes, when s = 1 
and <7 = 2 instead. 

Write G{R) for Gi{R) = J2r<R need several estimates for 

Gq{R) and G{R). As stated in IIRam95l Lemma 3.4], 


G{R) < logi7+ 1.4709 
for i? > 1. By 0MV73I Lem. 7], 

G{R) > log i?+ 1.07 
for R> 6. There is also the trivial bound 


( 12 . 8 ) 


(12.9) 






r<R 


(j){ 


r<R 


p\r 


r<R 


p\r r<R 


( 12 . 10 ) 


The following bound, also well-known and easy. 


G{R) < ^fqiR) < G{Rq), 


( 12 . 11 ) 


can be obtained by multiplying Gq{R) = J2r<R {r q)=i i''") / term-by-term by 

7/</>(7) = np|g(l + !/'('(?))■ 

We will also use Ramare’s estimate from IIRam951 Lem. 3.4]: 


GdiR) = 


(j){d) 


log 77 -I- c_E+ 

p\d 


\ogp 


+ 0* (7.28477-^/3/1 (d)) (12.12) 
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for all d £ Z+ and all i? > 1, where 


Md) = Y[{l+p-^/^)(l + 


p\d 


pl/3 + p^/3 

pip - 1 ) 


and 


ce 




p>2 


logp 

pip - 1 ) 


= 1.3325822... 


(12.13) 


(12.14) 


by llRS^ (2.11)]. 

If > 182, then 


log R + 1.312 < G(i?) < log i? + 1.354, 


(12.15) 


where the upper b ound is valid for R > 120. This is true by ( |l2.12| i for R > 4 ■ 10^; 
we check (12.15 1 for 120 < i? < 4 • 10^ by a numerical computationj^ Similarly, for 
R > 200, 


log+ 1.661 ^ < log j^+ 1.698 


(12.16) 


by (12.12 1 for i? > 1.6 • 10®, and by a numerical computation for 200 < i? < 1.6 • 10®. 
Write p = (log Qo)/(log Q) < 1. We obtain immediately from (|12.15[) and (|12.16| 


that 


GjQo) ^ logQo +1.354 
GiQ) - logQ +1.312 
G2iQo) ^ logQo + 1-698 


(12.17) 


G 2 iQ) log (5 +1.661 

for Q, Qq > 200. What is hard is to approximate GqiQo)/GqiQ) for q large and Qq 
small. 

Let us start by giving an easy bound, off from the truth by a factor of about e'’'. 
(Specialists will recognize this as a factor that appears often in first attempts at esti¬ 
mates based on either large or small sieves.) First, we need a simple explicit lemma. 


Lemma 12.2.1. Let m > 1, q > 1. Then 

n " 


p-1 


< e’^(log(m + \ogq) + 0.65771). 


(12.18) 


p\q\/p<m 

Proof. Let = Up<„,yp\qP- Then, by l|RS75l (5.1)], 

^<qY[p = 

p<m 

where cq = 0.001102. Now, by liRS62] (3.42)], 

n , 2.50637 , 2.50637 

< e ' log log n + -—^- < e ' log log x + 


fin) 


log log n 


log log X 


^Using D. Platt’s implementation IPlalll of double-precision interval arithmetic based on Lambov’s 
ILamOSI ideas. 
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for all a; > n > 27 (since, given a,b>0, the function t 1 —>■ a + b/t is increasing on t 
for t > ^fbja). Hence, if ge"* > 27, 

^ ^ ^ , N 2.50637 

< log((l + eojm + log q) + 


log(TO + log q) 

^ , 2.50637/eT' \ 

< e * log TO + logg) + eo + — ,■ -- —r • 
V log(TO + log g )} 


Thus (I 2 .I 81 holds when to + logg > 8.53, since then eg + (2.50637/e'’')/log(TO + 
logg) < 0.65771. We verify all choices of TO,g > 1 with to + logg < 8.53 compu¬ 
tationally; the worst case is that of to = 1, g = 6, which give the value 0.65771 in 
( [TTT8 ] i. □ 

Here is the promised easy bound. 

Lemma 12.2.2. Let Qo > 1. Q > 182(5o- 1 < Qo. s < Qgjq, g an integer. Then 

G,iQo/sq) ^ enog(^+logg)+1.172 ^ log+ 1.172 


GqiQ/sq) 


log ^ + 1.312 


< 


Qo ^ Qo 

Proof. Let = [Ip^Qo/^avplgP- Then 

GqiQg/Sq)G^{Q/Qg) < Gq{Q/sq) 

and so 


log ST+ 1-312 


< 


1 


Gq{Qg/sq) 

Gq{Q/sq) G^{QIQq) 


(12.19) 


Now the lower bound in (12.11 1 gives us that, for d = R — Q/Qg, 

GAQIQo) > ^^GiQIQg). 


By Lem. 12.2.1 




< 


(log( 


Qo 

sq 


+ logg + 0.658 


Hence, using ( 12.15| l, we get that 

Gq{Qg/sq) ^ ^ enog(^+logg) +1. 


172 


GqiQIsq) - G{QIQg) 
since QjQg > 184. Since 


Qo 


< 


log ^ + 1.312 


( 12 . 20 ) 


sq 


Qo 1 1 


^+logg =- 2 iL + - = - 1_+L <0, 


sg^ g g 


Qo 


sq 


the rightmost expression of P 2 . 20 |l is maximal for g = 1 . 


□ 
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Lemma 12.2.2| will play a crucial role in reducing to a finite computation the prob¬ 
lem of bounding Gq{Qo/ sq) /Gq{Q / sq). As we will now see, we can use Lemma 
|12.2.2| to obtain a bound that is useful when sq is large compared to Qo - precisely the 
case in which asymptotic estimates such as \\ 2 .\ 2 \ are relatively weak. 

Lemma 12.2.3. Let Qo > I, Q > 200Qo- Tet q < Qo> s < Qo/q- Let p = 
(logQo)/ logQ < 2/3. Then, for any a > 1.312p, 


GgiQo/sq) ^ \ogQo + a 


holds provided that 


GqiQ/sq) logQ +1-312 


— <c[a)-Q\ -logg, 

sq 


( 12 . 21 ) 


where c(a) = exp(exp(—7) • (a — ct^/5.248 — 1.172)). 

Proof. By Lemma [l2.2.2| we see that \\2.2\) will hold provided that 


eT'logf —+logg) + 1.172 < 


V sq 




1.312 


logg + 1.312 


(logQo +cr). 


( 12 . 22 ) 


The expression on the right of p2.22| l equals 

(log Qo + 0-) log Qo 


logQo 


logQ +1.312 

= (l-p)(logQo + o-) + 


1.312p(logQo + cr) 


log <3 

> (1 - p)(log(3o + cr) + 1.312p^ 


1.312 


and so \\2.22\ will hold provided that 

log + logg^ + 1.172 < (1 - /o)(log(3o) + (1 - p)^ + 1.312p^. 
Taking derivatives, we see that 

(1 - p). + 1.312,= - 1.172 > (1 - 2:^) -11.312 


- 1.172 


Hence it is enough that 

— + logg < ^ ^ 

sq 

where c{a) = exp(exp(—7) • (cr — cr^/5.248 — 1.172)). □ 

We now pass to the main result of the section. 
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Proposition 12.2.4. Let Q > 2 OOOOQ 0 . Qo > Qo.min, where Qo.min = 10^. Let 
p = (log Qo)/ log <5- Assume p < 0.6. Then, for every 1 < g < Qo cind every 
s e [l,Qo/g], 


Gq{Qo/sq) ^ logQo + C+ 


(12.23) 


Gq{Q/sq) logQ + CB’ 
where ce is as in { 12.14\ and c+ = 1.36. 

An ideal result would have c+ instead of ce, but this is not actually possible: error 
terms do exist, even if they are in reality smaller than the bound given in (12 . 12^ ; this 
means that a bound such as (12.23 1 with c+ instead of ce would be false for <7 = 1, 
s = 1. 

There is nothing special about the assumptions 

Q> 20000Qo, Qo>10^ (logQo)/(logQ) < 0.6. 

They can all be relaxed at the cost of an increase in c_|_. 

Proof. Define err^ so that 

logp 


Gq{R) = 


Hfl) 


\0gR + CE + Y^ 


p\q 


P 


+ eiTq^n ■ 


(12.24) 


Then ( fTZl^ will hold if 

, Qo , , logp 

log - +CE+ y - 

sq ^ P 

p\g 


</(<?) 



fiq) f 


log Qo + c+ 
log Q + ce ' 


(12.25) 


This, in turn, happens if 
log sq-Y^ 


\ogp 


1 - 


p\q 


log Qo + c+ 
log Q + C_E 


+ c+- Ce 


> 


'^{q) 


log Qo + c+ 

err Qo - . ^ , err q 

=9 logQ + Cs ‘^’=9 


Define 


^(P) = T 


log Qo ,mm C-| 


log Qo,i 


Ce 


= P- 


C+ — PCE 


-p log Qo.i, 


Ce 


Then p < (log Qo + c+)/ (log Q + ce) < w(p) (because c+ > pce)- We conclude that 
( |12.25| l (and hence ( |12.23| l) holds provided that 


(1 - u{p)) I log sq-Y I + 


CA 


p\q 


(12.26) 


> 




(eiT^ Qo +w(p) max (^0,-err_j_^^^ , 
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where ca = c+ — ce- Note that 1 — uj{p) >0. 

First, let us give some easy bounds on the error terms; these bounds will yield upper 
bounds for s. By ( |12.8| l and ( |12.11[ ), 


err. 


^ H<}) ( , logp 

.it< — llog7-2^ — 

p\q 


(1.4709- cb) 


for R> 1; by ( |12.15 1 and (12.11 1 , 

err > 


+ 1.312) 


, P\<1 


for R > 182. Therefore, the right side of (12.26 1 is at most 

logg - (1 - w(p))^ + ((1.4709 - ce) + w(p)(c£; - 1.312)), 

V P 

p\q 

and so ( |12.26[ ) holds provided that 

(1 - w(p)) logsg > logg + (1.4709 - ce) + uj{p){ce - 1.312) - ca- (12.27) 

We will thus be able to assume from now on that \\2.21\ does not hold, or, what is the 
same, that 

sq < ( Cp, 2 g) (12.28) 

holds, where Cp _2 = exp((1.4709 — ce) + uj{p){ce — 1.312) — ca). 

What values of i? = Qo/sq must we consider for g given? First, by (12.281, we 
' R > Qo,min/(cp, 2 g)^^^^~“^'’^^- We Can also assume 


can assume 


R > c(c+) • max(i?g, (5o,min)^^ - logg 


(12.29) 


for c(c+) is as in Lemma 12.2.3 since all smaller R are covered by that Lemma. 
Clearly, ( |12.29| l implies that 


R' 


> c(c+) • g^ - 


logg 

R^ 


> c(c+)g^ - logg, 


where r = (1 — p)e and also that R > — log g. Iterating, we obtain 

that we can assume that R > w{q), where 


w{q) = max tx 7 o(g),c(c+)Q 5 - logg, 


Qo,r 


(cp, 2 g) 


(12.30) 


and 


tx7o(g) = 


_ J c(c+)g^ - 


logg 


(c(c+)9^-logq) 1- 


0 


ifc(c+)g^ > logg + 1 , 
otherwise. 
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R > vj{q), we have 


Looking at ( |12.26[ ), we see that it will be enough to show that, for all R satisfying 

(12.31) 


+w(p) max (0, - err,_tij) < 


<^(9) 


9 


K{q) 


for all t > 20000, where 


k( 9) = (1 - w(p)) I log9-XI I 
V p|9 / 

Ramare’s bound ( 12.12} implies that 

I err,,i^ | < 7.284i?"^/^/i(g), 

with fi{q) as in ( |12.13 1, and so 

errq,_R+a;(p)max(0,-errq^tfl) < (l + /3p) • 7.284i?“^/^/i(g), 
where /3p = a;(p)/20000^/^. This is enough when 


(12.32) 


R > Hq) = 


q 7.284(1 + /3p)/i(g) 




K{q) 


(12.33) 


It remains to do two things. First, we have to compute how large q has to be for 
w{q) to be guaranteed to be greater than A(g). (For such q, there is no checking to be 
done.) Then, we check the inequality ( |12.31| l for all smaller q, letting R range through 
the integers in \w{q), A(g)]. We bound eiiq^tR using (12.32i, but we compute err^/j 
directly. 

How large must q be for w{q) > X{q) to hold? We claim that w{q) > \{q) 
whenever q > 2.2 • 10^°. Let us show this. 

It is easy to see that (p/(p— 1)) • /i(p) andp —>■ (logp)/p are decreasing functions 
of p for p > 3; moreover, for both functions, the value at p > 7 is smaller than for 
p = 2. Hence, we have that, for q < np<po P’ P^ ^ prime. 


c(g) > (1 -a;(p)) (logq- X 

\ p<po ^ ) 


CA 


(12.34) 


and 


•^(9) < n 


7.284(1 +/3p)np<p„/i(F) 


VP<P0 ^ ^ (1 - w(p)) (log 9 - E 

If we also assume that 2 • 3 • 5 • 7 -j" g, we obtain 

f 

K(q) > (1 - uj(p)) logg- X 

P<P0 
\ p#7 


logP 
P<Po p 


(12.35) 


CA 


logp 


H“ ca 


(12.36) 
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and 


Hq) < 


( 


n 

P<PO 

\p^7 


P 

p-1 


7.284(l + ft)n,<,„j^7/i(p) 


(12.37) 


for q < np<po ■ (We are taking out 7 because it is the “least helpful” prime to omit 
among all primes from 2 to 7, again by the fact that {p/{p — 1)) • fi{p) and p 
{logp)/p are decreasing functions forp > 3.) 

We know how to give upper bounds for the expression on the right of ( |12.35[ ). 
The task is in essence simple: we can base our bounds on the classic explicit work in 
IRS62I . except that we also have to optimize matters so that they are close to tight for 
Pi = 29, Pi = 31 and other low pi. 

By IIRS62I (3.30)] and a numerical computation for 29 < pi < 43, 


n 

p<pi 


p 

p-1 


< 1.90516 logpi 


for Pi > 29. Since uj{p) is increasing on p and we are assuming p < 0.6, Qo.min = 

100000, 


uj{p) < 0.627312, Pp < 0.023111. 

For X > a, where a > 1 is any constant, we obviously have 

n-2/3 


log(l+p < Y (logp) 


P 


adpKx 


a<.p<x 


logo 


by Abel summation (13.3 i and the estimate 0RS621 (3.32)] for 0{x) = X] 


p<x 


Y (logFb = {0{x) - 0{a))x 3- [ {9{u) - e{a)) (-‘^u 3 


a<.p<x 


logp, 

) du 


2 

< (1.01624a: — 0(a))a;“3 + - / {l.QlQ2Iu — 0{a)) u~idu 

^ J a 

= (1.01624a; - 6»(a))a:"5 + 2 • 1.01624(x^/^ - + 6»(a)(a:"^/^ - 

= 3 • 1.01624 • x^^^ - (2.03248a^/^ + 0 ( 0 ) 0 "^/^). 


We conclude that J2io^<p<x log(l + P < 0.33102a:^/^ — 7.06909 for x > 10'*. 
Since X]p<io3 — 10.09062, this means that 

10.09062- 7.06909 \ n,, l/c, 

Y log(l + F” ^ ) < ( 0.33102 + j a;*/^ < 0.47126a;*/^ 

p<x ^ '' 

for X > 10*; a direct computation for all x prime between 29 and 10* then confirms 
that 

Y log(l + < 0.74914a;*/3 

p^x 
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for all X > 29. Thus, 

n - 


iog(i+p 


p<x 


np<29 ( 


1 + 


pl/3_|_p2/3 \ 
p(p-l) ) 


< 


p0.74914a:^i/^ 

6.62365 


for a; > 29. Finally, by l|RS62l (3.24)], < logpi. 

We conclude that, for q < np<po P^’ Po ^ Pi the prime immediately 

preceding pq, 


I 


H<i) < 


7.45235 • 


1.90516 logpi 


^0.74914p}/^ 

6.62365 


\ 


< 


V 

190.272(logpi)3e2-24742p 

(logg-logpi +0.07354)3 ■ 


0.37268(logg - logpi) + 0.02741 


/ 


(12.38) 


1/3 


It is clear from (12.301 that w{q) is increasing as soon as 


q ^ 13iax((5o,min 7 Q\ 


l-ui{p) 
min, S:ro,min 


/cp.2) 


and c(c+)g'^ > log g + 1, since then vuo{q) is increasing and vu{q) = zuo{q). Here it is 
useful to recall that Cp,2 > exp(1.4709 — c+), and to note that c(c+)q'^ — (logg + 1) 
is increasing for q > 1/(t • c(c+))t/"'; we see also that l/(r • c(c+))t/'^ < 1/((1— 

0.6)e“7c(c+))t/((t-O.6)e ~') Jqj. 

p < 0.6. A quick computation for our value of C-|- 
makes us conclude that q > 1.12(5o.min = 112000 is a sufficient condition for vu{q) to 
be equal to vuo{q) and for vuo{q) to be increasing. 

Since (12.381 is decreasing on q for pi fixed, and vao{q) is decreasing on p and 
increasing on q, we set p = 0.6 and check that then 


whereas, by ( |12.38| l. 


Wo (2.2 • 10^°) > 846.765, 


A(2.2 • 10^'^) < 838.227 < 846.765; 


this is enough to ensure that \{q) < wo{q) for 2.2 • 10 10 <g< 

ripest p- 

Let us now give some rough bounds that will be enough to cover the case q > 
ripoiP. First, as we already discussed, w{q) = wo{q) and, since c(c+)g'^ > logg + 
1 , 


^oiq) > (c(c+)9’’ — log9)1-^ > (O.Ollg®'^^"^ — logg)^’^®® > gO.2797 


(12.39) 


by <? > np<3iP- We are in the range llp<p,P^Q < Ilp^poP’ where pi < po 
are two consecutive primes with pi > 31. By IIRS62I (3.16)] and a computation for 
31 < 9 < 200, we know that logg > np<pi logP ^ 0.8009pi. By (12.381 and 
(|12.39|l, it follows that we just have to show that 


g0.224t ^ 


190.272(logf)3e2'24742+i" 


(0.8009f-logf +0.07354)3 
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for t > 31. Now, f > 31 implies 0.8009f — logf + 0.07354 > 0.6924f, and so, taking 
logarithms we see that we just have to verify 

0 . 224 f - 2.24742fi/3 > slogbgf - 31ogf + 6.3513 (12.40) 


for f > 31, and, since the left side is increasing and the right side is decreasing for 
t > 31, this is trivial to check. 

We conclude that w{q) > X{q) whenever q > 2.2 • 10^°. 

It remains to see how we can relax this assumption if we assume that 2 • 3 • 5 • 71 g. 
We repeat the same analysis as before, using P2.36| l and (|12.37|) instead of (|12.34|l and 
( |l235] l. For Pi > 29, 


n 

p<pi 

p^7 


P 

P- 1 


< 1.633 log Pi, 


n - 

p<pi 

p^7 


g0.74914a:i''^-log(l + 7-^^^) 


g0.74914a:i/^ 

- 7.44586 


andEp<pi:p/7(logF)/P < logpi- (log7)/7. So,forg < np<po:p547F’andpi > 29 
the prime immediately preceding pg. 


/ 


H<i) < 


7.45235 


1.633 log Pi 


^0.74914py^ 

7.44586 


V 


0.37268 


g-logpi + + 


0.02741 


^ 84.351(logpi)3e2-24742p;''" 

- (logg-logpi +0.35152)3’ 


Thus we obtain, just like before, that 

tt7o(3.3 • 10^) > 477.465, A(3.3 • 10^) < 475.513 < 477.465. 


We also check that ■ci7o(go) > 916.322 is greater than A(go) < 429.731 for go = 
ripoi p^rP- The analysis for g > Hpor-p^^rP also just like before; since logg > 
0.8009pi — log 7, we have to show that 

g0.224t ^ 84.351(logf)^e^'^"‘’^‘^^‘^^^ 

7 ^ (0.8009f-log 1 + 0.07354)3 

for t > 37, and that, in turn, follows from 

0.2241 - 2.247421^/3 > 3 log log 1 - 3 log 1 + 6.74849, 


which we check for 1 > 37 just as we checked ( |12.40| i. 

We conclude that zu{q) > A(g) if g > 3.3 • 10^ and 2101 g. 

Computation. Now,forg < 3.3-10® (and also for 3.3-10® < g < 2.2-103®, 210|g), 
we need to check that the maximum of err^ over all vj{q) < R < A(g) 

satishes (12.311. Note that there is a term eiiq^tR in < 12.31 1 ; we bound it using ( 12.32||. 

Since log R is increasing on R and Gq{R) depends only on [i?J, we can tell from 
( 12.24|l that, since we are taking the maximum of err^ tj, it is enough to check integer 
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values of R. We check all integers R in A(q)) for all q < 3.3 • 10® (and all 

3.3 • 10® < q <2.2 ■ 10^®, 2 IOI 9 ) by an explicit computation|^ □ 


Finally, we have the trivial bound 

GqiQo/sq) 
GqiQ/sq) 

which we shall use for Qq close to Q. 


< 1 , 


(12.41) 


Corollary 12.2.5. Let an S C, be supported on the primes. Assume that 

{o„} is in £i n £2 and that a„ = 0 for n < ^Jx. Let Qq > 10®, 6q > 1 be such that 

(20000Qo)^ < x/25q; set Q = ^JxJWq. 

Let S{a) = "^n ane{an) for a G K/Z. Let 9Jl as in {12.11 . Then, if Qq < Q® ®, 

Jm logQ + cs 


where c+ = 1.36 and ce = 1 + /(f’(F ~ 1)) = 1.3325822 .... 

Let^So,Qo iti {10.5). Then, if {2Qq) < (2Q)® ®, 


'®J5o,Qo 


' ^ - log2Q + Ci5 


(12.42) 


Here, of course, da = J^n (Plancherel). If Qq > Q®'®, we will 

use the trivial bound 



|S'(a)|^ da < 


|5'(a)|^ da = y |a„|®. 

n 


(12.43) 


Proof. Immediate from Prop. 12.1.1[ Prop. 12.1.2 and Prop. 12.2.4 


□ 


Obviously, one can also give a statement derived from Prop. |12.1.1[ the resulting 
bound is 



\S{a)\'^da < 


logQo + c+ I |2 


where is as in 

We also record the large-sieve form of the result. 


^This is by far the heaviest computation in the present work, though it is still rather minor (about two 
weeks of computing on a single core of a fairly new (2010) desktop computer carrying out other tasks as well; 
this is next to nothing compared to the computations in Eib), or even those in IHP131 ). For the applications 
here, we could have assumed p < 8/15, and that would have reduced computation time drastically; the 
lighter assumption p < 0.6 was made with views to general applicability in the future. As elsewhere in this 
section, numerical computations were carried out by the author in C; all floating-point operations used D. 
Platt’s interval arithmetic package. 
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Corollary 12.2.6. Let N > 1. Let a„ G C, be supported on the integers 

n < N. Let Qg > 10®, Q > 20000(5o- Assume that an = 0/or every nfor which 
there is a p < Q dividing n. 

Let S{a) = arae(ckn) for a € M/Z. Then, ifQo < <5° ®, 

Q^Qo o. mod g h 

{a,q) = l 


where c+ = 1.36 and ce = "f + Sp> 2 (^®SP)/(p(^' ~ 1)) “ 1-3325822 .... 

Proof. Proceed as Ramare does in the proof of IIRam09l Thm. 5.2], with Jlfq = {a G 
'LjqL : (a, <f) = 1} and = a„); in particular, apply IIRam091 Thm. 2.1]. The proof 
of IIRam091 Thm. 5.2] shows that 

^ |5'(a/g)|^da < max ^ ^ \S{alq)fda. 

- A QSQo '~rq[LJ) ‘—f -' 

QSwo ^ mod q ^ / qSQo ^ mod q 

{a,q) = l (a,q) = t 

Now, instead of using the easy inequality Gq{Qo)/Gq{Q) < Gi{Qo)/Gi{Q/Qo), use 

Prop. |12.2.4| □ 



* * * 


It would seem desirable to prove a result such as Prop. 12.2.4 (or Cor. 12.2.5 


or 

Cor. |12.2.6| l without computations and with conditions that are as weak as possible. 
Since, as we said, we cannot make c+ equal to cb, and since c+ does have to increase 
when the conditions are weakened (as is shown by computations; this is not an arti¬ 
fact of our method of proof) the right goal might be to show that the maximum of 
Gq{Qo/sq)/Gq{Q/sq) is reached when s = q = 1. 

However, this is also untrue without conditions. For instance, for Qo = 2 and Q 
large, the value of Gq{Qo/q)/Gq{Q/q) at g = 2 is larger than at g = 1: by (12.12i, 


(t) 1 

^2 5 (log § + CB + 

2 2 GjQo) 

logQ + c_E — logQ + CE G{Q) 


Thus, at the very least, a lower bound on Qq is needed as a condition. This also dims 
the hopes somewhat for a combinatorial proof of Gq{Qo/q)G{Q) < Gq{Q/q)G{Qo)', 
at any rate, while such a proof would be welcome, it could not be extremely straightfor¬ 
ward, since there are terms in Gq{Qo/q)G{Q) that do not appear in Gq{Q / q)G{Qf). 
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Chapter 13 

The integral over the minor arcs 


The time has come to bound the part of our triple-product integral (10.3 1 that comes 
from the minor arcs m C K/Z. We have an estimate (from Prop. 11.2.3 based on 
Theorem 3.1.1 1 and an £2 estimate (from (12.2 1 . Now we must put them together. 

There are two ways in which we must be careful. A trivial bound of the form 
£3 = J |5'(Q;)pcia < £2 ■ £00 would introduce a fatal factor of logx coming from £2- 
We avoid this by using the fact that we have £2 estimates over dJlsg,Qg for varying Qo- 

We must also remember to substract the major-arc contribution from our estimate 
for Qj,; this is why we were careful to give a lower bound in Lem. 10.3.1 as 
opposed to just the upper bound (|10.28|l. 


13.1 Putting together £2 bounds over arcs and £00 bounds 

Let us start with a simple lemma - essentially a way to obtain upper bounds by means 
of summation by parts. 

Lemma 13.1.1. Let /, g : {a, a -I- 1, ..., &} —>■ IR(|, where a,b G Z"*". Assume that, for 
all X G [a, b], 

fin)<F{x), (13.1) 

a<n<x 

where F : [a, 6] —>■ M. is continuous, piecewise differentiable and non-decreasing. Then 
b 

f{n) ■ g{n) < (maxp(n)) • F{a) + 

n>a 

Proof. Let S{n) = /(^)- Then, by partial summation, 

b b-1 

Y ^ >5'(6)5(6) -f Y 'S'WCsH - 9{n + 1)). (13.2) 



(maxp(n)) 

n>u 


F'{u)d'h 
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Let h{x) 
that 


maxa;<„<b g(n). Then h is non-increasing. Hence (13.1 1 and (13.21 imply 

b b 

^ H f{n)h{n) 

n—a n—a 

6-1 

< S{b)h{b) + S{n){h{n) — h{n + 1)) 

n—a 

6-1 

< F{b)h{b) + ^ F{n){h{n) — h(n + 1)). 


In general, for G C, A(x) = 'Ylia<n<x ^ continuous and piecewise differ¬ 

entiable on [a, x], 


anF{x) = A{x)F{x) 

a<.n<.x 


A{u)F' {u)du. 


{Abel summation) (13.3) 


Applying this with = h{n)-h(n + T) and A{x) = J2a<n<x = h{a)-h{[x\ + 
1 ), we obtain 


b-l 


E^( n){h{n) — h{n + 1)) 

n—a 

pb-1 

= {h{a) - h(b))F{b - 1) - / {h{a) - h{[u\ + l))F\u)dt 

J a 

nb—l 

= h(a)F{a) - h{b)F{b - 1) + / h{[u\ + l)F'{u)du 

J a 
pb-1 

= h{a)F{a)-h{b)F{h-l)A / h{u)F'{u)du 

J a 

= h{a)F{a) - h{b)F{b) + j h{u)F'{u)du, 

J a 


since h(\u\ -f 1) = h{u) for rt ^ Z. Hence 

b 


E/ — h{a)F{a) + / h{u)F'{u)du. 


□ 


We will now see our main application of Lemma 13.1.1 We have to bound an 
integral of the form | 5 'i(a)p| 5 ' 2 (a)|(ia, where is a union of arcs defined 

asin(10.5l. Our inputs are (a) a bound on integrals of the form |5'i(a)p(iQ;, (b) 

a bound on | 5 ' 2 (Q!)| fora G (K/Z) \ The input of type (a) is what we derived in 

^| 12 . 1 |and ^12.2\ the input of type (b) is a minor-arcs bound, and as such was the main 


subject of Parth 
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Proposition 13.1.2. Let S'! (a) = J2n «ne(an), Un G C, {an} in L^. Let S 2 : R/Z —>■ 
C be continuous. Define £Olso,r as in ( |7Q.5 ). 

Let vq be a positive integer not greater than ri. Let H : [ro,ri] —^ M"*" be a 
continuous, piecewise differentiable, non-decreasing function such that 

\Siia)\^da< H{r) (13.4) 

,r+l 

for some Jq < x/2r\ and all r G [ro,ri]. Assume, moreover, that H{ri) = 1. Lef 
9 '■ [ro, ri] —>■ be a non-increasing function such that 

max |S' 2 (a)| < g(r) (13.5) 

ae(R/Z)\OT5n,J 



for all r G [ro, fi] and Sq as above. 
Then 

1 f 

EM 


|5'i(a)|^|S'2(a)|dQ; 




(13.6) 


where 


< giro) ■ {H{ro) - Jq) + f gir)H'{r)dr, 

J ro 

^ ^ \Si{a)\^da. (13.7) 


lo = 


J 2 n Jms^ 


The condition Sq < xj2ri is there just to ensure that the arcs in the definition of 
£fftSf,,r do not overlap for r < ri. 

Proof. For tq < r < ri, let 


/w = 12 


EJ« 




Let 


fi^i) = 


\Siia)\^da. 


\Siia)\'^da. 


Then, by (13.5 1 , 


En 7(R/Z)\OTa, 


\Siia)\‘^\S 2 ia)\da < ^ /(r)p(r). 


By ( [13:41 1, 

E = ^ 


ro<r<a: 


a. 


n\ JmSg,:c + l\'XlSg,rg 

1 


\Si{a)\'^da 


EnK 


f \Siia)f da\ - To < H{x) - To 

'^'^Sq,x + 1 j 


(13.8) 
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fora; G [ro,ri). Moreover, 


I] /W = 

ro<r<ri 


1 


/ 1 


'(R/Z)\OTio 


|5i(a)p 




[ \S,ia)\A -Io = l-Io = H{r,)-Io. 
JR/Z J 


We let F(x) = H(x) — Iq and apply Lemma 13.1.1 with a = xq, b = ri. We 
obtain that 

' rri 

Y] f{i') 9 {r) < {rnaxg{r))F{ro) + / (maxg(r))F'(u) du 

r>ro r>u 

r—ro ' 0 

< 9 {ro){H(ro) - lo) + ( g(u)H'(u) du. 

J rn 


□ 


13.2 The minor-arc total 


We now apply Prop. 13.1.2 Inevitably, the main statement involves some integrals that 
will have to be evaluated at the end of the section. 


Theorem 13.2.1. Let x > 10^^ • >c, where >c>l. Let 

Srfia,x) = E A(n)e(an)ri(n/x). 


(13.9) 


Let ? 7 *(f) = (r ]2 *m T)i^)y where 772 in [ILIO) and ip : [0, 00 ) —>■ [0, 00 ) is 
continuous and in £^. Let 77 + : [0, 00 ) —^ [0, 00 ) be a bounded, piecewise differentiable 
function with limt_>.oo il+(t) = 0. Let Hfflsg^r be as in (10.5 \ with Jq = 8 . Let 10® < 
ro < ri, where ri = (3/8)(x/>f)"‘/^®. Let g(r) = 9x1 where 


gy^^(r) = ^Y,K,iP. 2 r log 2 r ^.5) + % + 3.36^1/(13- 


'jTr 


10 ) 


just as in (11.19), and K = \og(x/if)/2. Here Ry^K,rj>,t is as in (11.19), and Lt is as 
in (TTl3\ . 


Denote 


Zro — 


/(R/Z)\OT8,, 


15^,, (a, a;)115^,^ (a, x)\^da. 
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where 


*5'= XI 


P>y/x 


T = C^.3 ( 

llog^^.(5-(v77-V^)2), 

J= 1 

|5',;^(a,a;)|^ da, 

•CWts.r-O 



E — + (^^+, 2 ) log X + (2C'^^_o + Cr;+.l)) ’ 1 

1 

(^ 7 ,+ ,0 = 0.7131 / —^ {sup r]+{r))^dt, 

Jo yt r>t 

(^7,+ ,1 = 0.7131 / i^(sup?7+(r))^dt, 

Jl VC r>t 

C,^.2 = 0 . 51942 | r ;+|^, 

1 04488 7^/^ 

C^,,{K) = ^^J^ \^{w)\dw 


(13.11) 


(13.12) 




+ 


9{r) 


hgx + 2c- 
where c+ = 2.0532 and c_ = 0.6394. 


dr + — + 


7 —2.14938 + ^ log XT 


15 


log a; + 2 c~ 


5(ci) 


5 

(13.13) 


Proof. Let y — xjx. Let Q = ( 3 / 4 ) 2 /^/^, as in Thm. 


3.1.1 


(applied with y instead 


of x). Let a G (R/Z) \ Olts.r, where rg < r < and y is used instead of 

X to dehne fOls^r (see ( |10.5 1). There exists an approximation 2a = ajq + Sjy with 
9 Q, l^l/y < l/^Q- Thus, a = a'/q' + 6/2y, where either a'/q' = a/2q or 
o-'/ 9 ' = (o + 9)729 holds. (In particular, if q' is odd, then q' = q\ if q' is even, then 

9' = 29.) 

There are three cases; 

1. 9 < r. Then either (a) q' is odd and q' < r or (b) q' is even and q' < 2r. 
Since a is not in VJls,r, then, by dehnition (10.5i, \S\/2y > 5gr/2qy, and so 
1^1 > ^of/q = 8r/q. In particular, |(5| > 8 . 

Thus, by Prop. |1 1.2. 3| 

| 5 ' 7 ,.(a,a;)| = \Srj^*,^ 4 ,{a,y)\ < ^^9^ Iv^li?/ < 9y,v{r)-\if\iy, (13.14) 


where we use the fact that g{r) is a non-increasing function (Lemma 11.2.4 1 . 
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2. r < q < jh. Then, by Prop. 11.2.3 and Lemma 11.2.4 


|S'^,(a,a;)| = 2/)l < 1 ) g ) • \q}\iy 

< 5y.vW • \T\iy- 


(13.15) 


3. q > y^^^(6. Again by Prop. 


11.2.3 


= |5'^2*«0(a>2/)l < { k ) + (13.16) 


where h{x) is as in (11.15 i. (Of course, C^^^{K), as in ( 13.12| i, is equal to 
where is as in (11.21 1 .) We set K = (logt/)/2. Since 

y = xj K> 10^®, it follows that y/K = 2j// log y > 3.47 • 10^^ > 2.16 • 10^°. 


Let 


n = 


9{r) = 

{9yA0 


if r < ri, 
(ri) if r > ri. 


By Lemma 11.2.4 for r > 670, g{r) is a non-increasing function and g{r) > gy^^{r). 
Moreover, by Lemma 11.2.5 gy^^[ri) > h{2y/ log y), where h is as in (|1 L15[), and so 


g{r) > h{2y/ logy) for all r > tq > 670. Thus, we have shown that 


\SnAy,o:)\ < 9{r) + C'v’.s 


logy 


Iv^liy 


(13.17) 


for all a € (M/Z) \ 

We first need to undertake the fairly dull task of getting non-prime or small n out 
of the sum defining 5^^ (a, x). Write 

5'i,,,+ (a,a;) = ^ (logp)e(ap)y+(p/a;), 

P>y/x 

*^2,77+ E A{n)e{an)g+{n/x) + ^ A{n)e{an)g+{n/x). 

n non-prime n<y/x 

n>y/x 


By the triangle inequality (with weights {a, x) |), 


'(R/Z)\OTs,r-o 


\Srj, {a, a;)| 15^^ (a, x)\‘^da 


< ,/7 \Sy,{a,x)\\Sj^r,+ {a,x)\^da. 

j = l V (®/^)\®^ 8,>-0 
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Clearly, 


'(®/Z)\OTs.. 


(a, a;)| 152 ,,,+ (a, x)f da 


< max |5 ,,„(q;,x)| • / \S2,ri+{ct, x)\'^da 

Jv./’L 


< ^ A(n) 7 ?*(n/x) • I ^ A{nf'q+{n/xf+ ^ K{nfr]+{n/xf 

n<y/x 


n—l 


\. n non-prime 


Let ? 7 +(z) = supo,^ t?+(i)- Since 77 +(t) tends to 0 as t —)• 00 , so does 77+. By IIRS621 
Thm. 13], partial summation and integration by parts, 

^ A{nfr]+{n/xf < ^ A{nfrp;{n/xf 


n non-pnme 


n non-pnme 


( 


< - 


< - 


\ 


E 


n<t 

n non-prime 


{rj+^it/x))' dt 




{\ogt) ■ l.A262Vi (rj+^^t/x))' dt 


< 0.7131 


loge^i _2 f ^ 


Vt 


77+ - u< 


= 0.7131 


2 + logte_2, \ 7 \ r- 

Ji/ - Vt - 


while, by llRS62l Thm. 12], 


E A(7r)277+(7z/x)2 < ||77+|^(logx) E 


n<y/x 


n<y/x 


< 0.51942|77+|^ • v^logx. 


This shows that 


p 00 

/ 15,,. (a, x) 11 52 .,,+(a, x) pda < A{n)ri^{n / x) ■ E = 5,,,.(0,x) • E, 

d(R/Z)\OTts,T.o 1 


where E is as in ( |13.1 l| l. 
It remains to bound 


/(B/Z)\OT8,: 


15,,. (a, x)||5i,,,+ (a, x)pda. 


(13.18) 


We wish to apply Prop. 13.1.2 Corollary 12.2.5 gives us an input of type (13.4l; we 
have just derived a bound ( 13.17|l that provides an input of type (|13.5|l. More precisely. 
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by ( |12.42 1 , (13.4 1 holds with 


r log(r-+l)+c+ 

H{r) = < logv^+c 

I 1 if r > ri, 

where c+ = 2.0532 > log 2 + 1.36 and c~ = 0.6394 < log(l/-\/2 ■ 8) + log 2 + 
1.3325822. (We can apply Corollary 12.2.5 because 2(ri + 1) = (3/4)a;'^/^® + 2 < 
(2 -^/x 716)° ® for X > 10^® (or even for x > 100000).) Since ri = and 

X > 10^5 • XT, 

1„„ H(.) - li„ H(r) = I - + 


< 1 _ (^11^ + log|+c+-Alog>,-Ac- 

“ V 1/2 \og^/x + c“ 


^ 7 —2.14938 + log >r 

“ 15 log a: + 2c“ 


We also have (|13.5[) with 


g[r) + ( 7^,3 


logy 


\iy 


(13.19) 


instead of g{r) (by (13.17 1 ). Here (13.19i is a non-increasing function of r because 


g{r) is, as we already checked. Hence, Prop. 13.1.2 gives us that (13.18 i is at most 


y(ro)-(i7(ro)-/o) + (l-/o)-C^.3 


logy 


1 


nri 


gir) 


dr 


7 —2.14938 + ^ log >r 


log ^/x + c~ Jrg r + 1 \ 15 log X + 2c~ 

times Iv^liy • J2p>^(}ogp)^gl{p/x), where 


If] — 


Y.p>v^(.^ogpypl{n/x) 

By the triangle inequality. 


|i5'i^^_^(a,x)p da. 


giri) 

(13.20) 

(13.21) 


\ \Si,p_^_{a,x)\'^ da = , \Sp^{a,x) - S2,p+{a,x)\'^ da 

V g^S.r-o V 

> \ [ \Sp+{a,x)\'^ da-. [ \S2,p+{a,x)\'^ da 

V “'*^8.-0 V 

>\ \Sri^{a,x)\'^ da-J \S2,p^{a,x)\'^ da. 

y Jms.r-o y 7 r/z 
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As we already showed. 



a, a;)p da 


Hn)^V+iri/xf < E. 

n non-prime 
or n < y/x 


Thus, 


Io-S>{VJ-VEf, 


and so we are done. 


□ 


We now should estimate the integral in (13.131. It is easy to see that 


fOO 1 2 

7 dr = 


'ro 


r3/2 


1/2 ’ 


log r log ero 
— 7 ^ dr = -, 


1 1 
-ndr = —, 


'ro 


fo 


'ro 


ro 


-dr = log—, 
r ro 


logr 21oge^ro 


log2r , 21og2e^ro 
-dr = 


r3/2 


(log2r)2 2P2(log2ro) 


r3/2 


-dr = 


(log2r)3_, 2P3(log2ro) 




r3/2 


-dr = 


1/2 


where 


P2(t) = t^+4t + 8, Psit) =t^ + + 24i + 48. 

We also have 


dr 


Jro r^ log r 

where Ei is the exponential integral 


= -El (log To) 


(13.22) 

(13.23) 

(13.24) 


Ei(^) 



We must also estimate the integrals 



j.3/2 


-dr, 




F (r) log r 


dr, 



F{r) 

j.3/2 


dr, (13.25) 


Clearly, F(r) — ei" log logr = 2.50637/log logr is decreasing on r. Hence, for 
r > 10®, 

r(r) < log log r + c-^, 

where c~f = 1.025742. Let F{t) = logf + c-^. Then F"{t) = < 0. Hence 


F"(t) [F'(t)f 

df^ 2^fF[t) 4(F(t))3/2 
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for all f > 0. In other words, y/F{t) is convex-down, and so we can bound \/F{t) 
from above by ^F{tQ) + \/~F (to) ■ {t — tg), for any t > tg > 0. Hence, for r > vq > 
10 ®, 

Vr{r) < y^F{logr) < A/i^(log Tq) -f ^}^\ t=logro ■ log — 


= \/-F(log ro) -f 


dt 

log- 

O ro 


ro 


A/J^(log ro) 2 log To 


Thus, by ( [TT2^ , 


'ro 


< o/Filogro) ^2 - 


+ 


1 

^(logro)^ 
log e^ro 


v/-F(log ro) log ro 
2 ^f(logro) / 


(13.26) 


1 + 


V -F(logro)logro 

The other integrals in ( 13.25|l are easier. Just as in ( |13.26| l, we extend the range of 
integration to [rg, oo]. Using (|13.22i and (13.24i, we obtain 


r Fjr) 
Jro r^ 

r(r) log r 


dr < 


dr < 


U(logr) 


dr = 


/loglog ro 


V ^0 
(1 + logrp) log log rp + 1 
ro 


+ E,ilogro)]+^ 


ro 


+ El (log ro) 


log ero 
ro 


By nOLBClOl (6.8.2)], 


1 


r(logr + 1) 

(The second inequality is obvious.) Hence 


< £^i(logr) < 


r log r 


F 


F (r) ^ eT'(log logrp + l/logrp) + c.^ 

■0 r2 “ rp 

(r) log r , 

-dr < 


'ro 


ro 


Finally, 


r{r) 

J. 3 I 2 


< 


2 log log rp 

yjrd 


< ( ^’(logro) + 


2Ei 

2^ 


logrp 


' log erp. 


2 c.y 


(13.27) 


It is time to estimate 


logrp / ■ 
i? 2 , 2 r log2r^r(r) 


r3/2 


dr, 


(13.28) 























































13.2. THE MINOR-ARC TOTAL 


255 


where z = y or z = y/{{\ogy)/2) (and y = xjyc, as before), and where is as 
defined in ( 11.13|l. By Cauchy-Schwarz, ( 13.28|l is at most 


/ n (i?.,2rlog2r)2 

V Jr, r3/2 



(13.29) 


We have already bounded the second integral. Let us look at the first one. We can write 
= 0.27125i?° t + 0.41415, where 


Clearly, 


K.t = log 1 + 


log At 

2 log 

^ 2.004t , 


^°z,e04 


log 



t/2 


log 


3621/3 

2.004 



Now, for f{t) = log(c + at/{b — t)) and t G [0, b). 


(13.30) 


fit) 


ab 

{c+^)ib-t)^’ 


nt) = “ 


ab{{a — 2c)(5 — 2t) — 2ct) 

{c+^J)\b-t)^ 


In our case, a = 1/2, c = 1 and b — log 362^/3 _ log(2.004) > 0. Hence, for t < b, 
-ab{{a - 2c){b - 2t) - 2ct) = ^ ^2f + ^(6 - 2f)^ = ^ Qfe > 0 , 


and so f"{t) > 0. In other words, t ^ R° ^ is convex-up for t < b, i.e., for 
e*/4 < 90^/3/2.004. It is easy to check that, since we are assuming y > 10 ^®^ 


= <-L ( 

16^ 2.004 V logy/ - 2.004’ 


We conclude that r R°z,2r is convex-up on logSr for r < ri, and hence so is 
r —> Rz^r, and so, in turn, is r —Thus, for r G [ro, ri]. 


p 2 

-^2,2r 


— ^z,2ro 


log/^lA , n 2 
logri/ro 


log r/rp 
logri/ro' 


(13.31) 
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Therefore, by (13.221, 


{Rz.2r log 2 r)" 


'»’o 


< 


r-3/2 


dr 


R 


logri/r 


Rt 


log r/ro 


+ 


‘^R’z,2ro 

log ^ 

^ ro 

^Rhr, 

log — 

^ ro 


^’^^°\ogri/ro ^■^’'Mogri/ro 
P2(log2ro) P2(log2ri) 


(log 2 r)^ 


dr 

j,3/2 


Vn 

P3(log2ro) P3(log2ri) 


1 o -P3(log2ro) P3(log2ri) 

log 2 ri-—-^-—- 

y/ri 

P2(log2ro) P2(log2ri) 


= 2 P: 


z,2ro 


VN) 

log 2 ro 

logs 




{Rz,2rx Rz,2ro^ ' ^ 


V'^0 

P 2 (log 2ro) 




P 2 (log 2ri) 


log 20 


+ 2 - 


Rl,2ri - Rl,2ro f P3i^Og2ro) P 3 (log 2 ri) 


log^ 

o ro 


= 2R 


z,2rQ 
2 


P2(log2ro) P2(log2ri)' 






+ 2 - 


-Rz, 2 ri - ^ 2 . 2 ro f P 2 (log 2 ro) P 3 (log 2 ri) - (log 2 ro)P 2 (log 2 ri) 


log: 




SIT 


(13.32) 

where P 2 (t) and P^{t) are as in (13.231, and P^ {t) = P3{t) — tP2{t) = 2f^ + 16f + 48. 


Putting all terms together, we conclude that 



g{r) 


dr < fo{ro,y) + /iSo) +/ 2 (ro, 2 /), 


(13.33) 


where 


/o(?'0) y) — ^(1 Cip) \/lo,ro,rx,y + ^ 




T,ro 


tf ^ o o^;((iogy)/ 2 )^^®, p 
/2(ro, y) = 3.36- ^ -log ; 


+ 


r(logro)logroy 

logero + 11.07 ) Jrg + 13.66 log eo + 37.55 


(13.34) 


ro 
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where F{t) = logt + c^, = 1.025742, y = x/>c (as usual), 


^0,ro,ri ,2 


= R 


z,2ro 


P2(log2ro) P2(log2ri) 


p2 _ p2 

-^2,2ri '^2,2ro 

log ^ 

° ro 


P 2 (log2ro) P 3 (log 2 ri) - (log 2 ro)P 2 (log2ri) 


Jr = P(logr) + 


logr ’ 


h^r = -f^(logr) + 




2 e^ 

logr’ 


C. 


(^,2, 


log 




log 


logy 


(13.35) 

and Cip^ 2 .K is as in (11.20 1 . 

Let us reca pitulat e briefly. The term / 2 (ro,y) in (13.34 1 comes from t he term 
3.36a;“^/^® in (|ll.l2i. The term /i(ro, y) includes all other terms in ( 11.12 1 , except 


for Rx, 2 r log 2r'^JT~(r}/{V^). The contribution of that last term is (13.28 1 , divided 


by \f2. That, in turn, is at most (13.29 1 , divided by \f2. The first integral in (13.29 1 
was bounded in (13.32i; the second integral was bounded in ( 13.27|i. 
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Chapter 14 

Conclusion 


We now need to gather all results, using the smoothing functions 

V* = {V2 *M 


where ip{t) = r ]2 = m *m m and rji = 2 ■ /[_i/ 2 ,i/ 2 ], and 

V+ = 

where 

pOO 

hnit) = / h{ty~^)FHiy)' 

Jo 


h{t) = 


,dy 

10 y 

'f2(2-t)3e‘-iZ2 iftG[ 0 , 2 ], 


0 


otherwise, 


Fnit) = 


sin(glogy) 
TT log 2 / 


We studied 77 * and 77 + in Part|^ We saw 77 * in Thm. 13.2.1 (which actually works for 
general (p : [0, 00 ) —>■ [0, 00 ), as its statement says). We will set k soon. 

We fix a value for r, namely, r = 150000. Our results will have to be valid for any 
X > x+, where x+ is fixed. We set x+ = 4.9 • 10^®, since we want a result valid for 
N > 10^^, and, as was discussed in (11.1 1 , we will work with x+ slightly smaller than 
N/2. 


14.1 The £2 norm over the major arcs: explicit version 

We apply Lemma 10.3.1 with y = rj+ and 770 as in (|1 1.3[). Let us first work out the 


error terms defined in ( 10.27| i. Recall that Sq = 8 . By Thm. |7.l!4 


ETr,^,Sor /2 = inax |err^,;^.^((5,x)| 

\S\<Sor/2 


= 4.772 ■ 10 


-11 


251400 


< 1.1405 • 10"®, 


(14.1) 
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E., 




max y/(f\errrj^^x^{S,x)\ 

X mod q ’ 


(ir<T--gcd(ij,2) 

|( 5 |<gcd((j, 2 )( 5 or -/25 


< 1.3482 • IQ-^'^VSOOOOO- 

< 2.3992 • 10"®, 


1 . 617 - 10 - 1 ° 1 


72 


(499900 + 5273000001 


(14.2) 

where, in the latter case, we are using the fact that a stronger bound for g = 1 (namely, 
( |14.1| l) allows us to assume q> 2. 

We also need to bound a few norms: by the estimates in 9A.3|and 9A.51 (applied 
with H = 200), 


|77+|i < 1.062319, I 77 +I 2 < 0.800129 + - 0-800132 


7+L < 1 + 2.06440727 
By ( [TOl^ and ([T+T), 


l + ^logi7 
H 


(14.3) 


< 1.079955. 


|-5'r,+ (0,a:)| = |r?+(0) ■ x + O* (err,,^,^,,,(0,a:)) • a:| 

< (|ry+|i + ET^^^Sor/ 2 )x < 1.063a;. 

This is far from optimal, but it will do, since all we wish to do with this is to bound the 
tiny error term in ( 10.27| : 

Kr ^2 = (1 + 7300000)(log a:)2 -1.079955 

- (2 - 1.06232 + (1 + 7300000)(loga;)7.079955/a;) 

< 1259.06(loga;)2 < 9.71 - lO-^^a; 


for a: > a:+. By (14.1 1 , we also have 


5.195o?’ I ET^^^s^ 


\v+U 


ET 


v+, 

2 


< 0.075272 


and 


5or(log2e7) [E^^,r,6o + Kr, 2 /x) < 1.00393 - 10“ 
By ( [a: 2^ and (|A|^, 

0.8001287 < |7;o|2 < 0.8001288 


and 


, , 274.856893 „ . 

\v+ -Vo\ 2 < < 2.42942 - lO"®. 


(14.4) 


(14.5) 


We bound \r]o \i using the fact that (as we can tell by taking derivatives) 770 {t) 
increases from 0 at f = 0 to a maximum within [0,1 /2], and then decreases tor/o (1) = 
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—7, only to increase to a maximum within [3/2, 2] (equal to the maximum attained 
within [0,1/2]) and then decrease to 0 at t = 2: 


|? 7 i^^|i =2 max T]i^\t) - 2 max r]i^\t) 

tG[0,l/2] tG[3/2,2] 

= 4 max + 14 < 4 • 4.6255653 + 14 < 32.5023, 

tG [0,1/2] 


(14.6) 


where we compute the maximum by the bisection method with 30 iterations (using 
interval arithmetic, as always). 

We evaluate explicitly 


E 

q<r 
q odd 


//!M 

</(9) 


= 6.798779..., 


using, yet again, interval arithmetic. 

Looking at ( |10.29[ ) and ( |10.28| l, we conclude that 


Lv,So < 2 • 6.798779 • 0.8001322 < 8.70531, 

LrM > 2 • 6.798779 • 0.80012872 - ((logr + 1.7) • (3.888 • 10”® + 5.91 • 10 -^ 2 )) 
- (1.342 • 10"®) • (^0.64787+ > 8.70517. 


Lemma 10.3.1 thus gives us that 


/ \Sr^^[a,x)\^ da = (8.70524 + 0*(0.00007))x + 0*(0.075273)a; 

Jms,r-o 

= (8.7052 + 0*(0.0754))a; < 8.7806x. 


(14.7) 


14.2 The total major-arc contribution 

First of all, we must bound from below 


Go=n(i-5;/iy^)n(i 


1 

(p - 1 )= 


pjAT ^ pfAT 

The only prime that we know does not divide N is 2. Thus, we use the bound 




p>2 


, ^,,,> 1.3203236. 

(T- 1)2 ' 


(14.8) 


(14.9) 


The other main constant is which we defined in ( 10.37|i and already started 

to estimate in (|1L6[I: 


Cpo.p. = I70I2 / ri^ip)dp + 2.71\r]'J2-O 


{{2- N/x) + pfr)^{p)dp 


(14.10) 
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provided that N > 2x. Recall that 77 * = (772 *m p){^'t )7 where ip(t) = t^e 
Therefore, 


nN/x 


V*ip)dp= f {r) 2 *(p){}i:p)dp= f m^f (p (—) dp— 

Jo Ji/i Jo w 


\m\i\p\i 1 r 


1/4 

- — / V2{w) [ p{p)dpdw. 

^Jlj 4 JycNjxw 

By integration by parts and 0AS64I (7.1.13)], 

/ p{p)dp = 7 /e“^ / e“‘ df < (7/ + - ) 

Jy Jy/Ci \ yj 




Hence 


[ p{p)dp< [ p{p)dp < [2>c+e~ 
JxN/xw J2x \ J 


and so, since I 772 I 1 = 1 , 

rN/. 


[ ??*(p)dp>—-/ p2iw)dw-(2 + 

Jo ^ Jl/4 \ 


> 


1/4 

- (2 + 


1 




2 >f 2 


(14.11) 




Let us now focus on the second integral in (14.101. Write N/x = 2 + ci/xr. Then 
the integral equals 


^ 2 +ci/x 2 

/ {-Ci/>C +pfr]^{p)dp< ^ {u-Cif {P 2 *M p)(u) 

Jo ^ Jo 

= ^ / 772(77;) / ivw — ci)^p(v)dvdw 

Ji/4 Jo 

“ i i,4 - 2 . 2 c,«, + C^yi) <i«, 


du 


1 1/4, 

1 /49 /)r 9 


>r3 2 4^1 + -^1 I ■ 

It is thus best to choose ci = (9/4)/-\/27r = 0.89762 .... 

We must now estimate I 77 'H. We could do this directly by rigorous numerical 
integration, but we might as well do it the hard way (which is actually rather easy). By 
the dehnition ( |11.3| l of rjo, 

+ 1)P = - 18x^2 4443.10 _ 2843-8 3543.6 _ 240a;4 + 49a;2) 

(14.12) 

for X G [—1,1], and 77 '(a; + 1) =0 for x ^ [—1, Ij. Now, for any even integer k > 0, 


x^e = 2 / x'‘e ^‘<ix = 7 


k ^—x^ 


fc +1 
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where '){a,r) = is the incomplete gamma function. (We substitute 

f in the integral.) By IIAS641 (6.5.16), (6.5.22)], 7 ( 0 + 1,1) = 07 ( 0 ,1) — 1/e for 
all a > 0 , and 7 ( 1 / 2 , 1 ) = i/ 7 rerf(l), where 


erf(z) = — 


dt. 


Thus, starting from (14.12 1 , we see that 


15 


13 


11 


bol2 = 7 - 18-7 ^,1 + 111 -7 


- 284-7 f^,l) + 351- 7 (^, 1 ) - 210 - 7 ( 1,1 


+ 49-7( 2>1 


128 

We thus obtain 


64e 


(14.13) 


2.7II77: 


We conclude that 




({2- N/x) + pfr].,{p)dp 




(9/4)2 \ ^ 2.0002 
2 v^ / ~ ’ 


Ci7o,r;, + ^l + |l|^o|2 \V 0\2 


2>r2 J 


0002 


Setting 


>£■ = 49 


and using ( 14.4|i, we obtain 


> -d^^lilpoli- 0.000834). 

K 


(14.14) 


Here it is useful to note that \(p\i = and so, by (14.4i, |<p|i|t?o |2 = 0.80237 .... 
We have finally chosen x in terms of N: 


N 


N 


2+ £1 


9/4 1 
+ 27 ? 49 


= 0.495461 ...■ N. 


(14.15) 


Thus, we see that, since we are assuming N > 102^, we in fact have x > 4.95461... - 
102 ®, and so, in particular. 


a:>4.9-102®, ->10^^ 

X 


(14.16) 
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Let us continue with our determination of the major-arcs total. We should com¬ 
pute the quantities in (10.381. We already have bounds for Eri^^r,5o^ (see (14.7 1 ), 
Ln,r,So and Kr^ 2 - By Corollary |7.1.3| we have 


Erj.f.,r.8 — 


max v^l err^,,x* (<^i a;)| 

X mod q ’ 

g<r-gcd(g,2) 

|5|<gcd((3',2)(5or/2(j 


1 


< - ( 2.485 • 10"^® -f 


1 


Vw 


25 


(381500-f 76V300000n 


(14.17) 


< 


1.33805 • 10-® 


where the factor of k comes from the scaling in r;*(f) = (p 2 *m T’){xT) (which in 
effect divides x by xr). It remains only to bound the more harmless terms of type 2 


and LSr). 


Clearly, 2 < (l/x) A(n)(logn)77^(n/a:). Now, by Prop. 

00 

EA( n){\ogn)Tf{n/x) 


7.1.5 


n—1 


= ( 0.640206 + 0 * [2- 10"® -f 


366.91 


xlogx — 0.021095a: 


(14.18) 


< (0.640206 -f 0*(3 • 10"®))a:loga; - 0.021095a:. 


Thus, 

2 < 0.640209 log X. (14.19) 

We will proceed a little more crudely for Z ^2 2 : 

= -'^^^{n)'ql{n/x) < - VA(n)77*(n/x) • (? 7 *(n/x) log n) 

n n 

< (|77*|i -f |err^.,;^.j,(0,x)|) • {\r]^{t) • log+(x:f)|oo + |p*U log(x/>f)), 

(14.20) 

where log^(f) := max(0, log t). It is easy to see that 


|7*|oo = |??2 *M <p|oo < 


72 (i) 


< 4(log2)2 • - < 1.414, 
e 


(14.21) 


and, since log’*' is non-decreasing and 772 is supported on a subset of [0,1], 


I? 7 *(f) • log+(x:f)|oo = 1(72 *M p) ■ log"^ loo < I 72 *M (t ■ log'^jlc 


< 


72 (f) 


\ip ■ log"* 


< 1.921813 • 0.381157 < 0.732513 


where we bound \ip ■ log'*’ |oo by the bisection method with 25 iterations. We already 
know that 
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By Cor. 7.1.3 


|err^.,x.^(0,a;)| < 2.485 • 10"^^ + ^=(381500 + 76) < 1.20665 • lO'C 

vlO^® 

We conclude that 

Z^ 2^2 < (\/7r72/49+ 1.20665 •10-’^)(0.732513 + 1.4141og(x/49)) < 0.0362 log a;. 

(14.23) 

We have bounds for |77*|oo and |??+|oo- We can also bound 

I ^1 |(??2 *M </5) • ^ I?72|l • • i|oo / 33/2e-3/2 

\ll* ■ ^ OO = - < - < -• 

K K K 

We quote the estimate 

\r]+-t\^ = 1.064735 + 3.25312 • (1 + (4/7r) log200)/200 < 1.19073 (14.24) 

from ( |A.42| ). 

We can now bound LSrj{x, r) for rj = ??*, ??+: 


LSrj {x,r) = log r • max 77 ( — 

vKr ^ 

a>l 

( 

logp 
/log a; 


< (logr) • max 

p<r 


logx , l7-i|c 
imoo + 2 ^ 


< (logr) • max 


< 


p<r \logp 

(logr) (log a;) 


\v\c 


a>l 
>x 

Iv ' ^\oo 
1 - l/p 


p“/a 


and so 


LSri. < 


1.414 


- ' log2 
1.07996 


log a; + 2 


log 2 

49 


l^loo “t“ 2(log7’)|7] • t\ 00 1 


log r < 24.32 log a; + 0.57, 


(14.25) 


LSr,^ < 


log 2 


loga; + 2-1.19073 logr < 18.57 log a; + 28.39, 


where we are using the bound on |?7+ |oo in < 14.3 1 

We can now start to put together all terms in (10.361. Let eo = |? 7 + — 77o|2/|t?o|2- 
Then, by ( |14.5| l, 


Thus, 


eo bo |2 = l7+-7o|2< 2.42942-10-6. 

4.31004bo|i +0.0012lll|y 
2.82643bo|2(2 + eo) • eo +-^ 
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is at most 


2.82643 • 2.42942 • 10”® • (2 • 0.80013 + 2.42942 • 10"®) 


+ 


4.3101 • 0.80013^ + 0.0012 • 5?^ 


150000 


< 2.9387 • 10 


-5 


by ([T^, ( flAel l, and ( flA^ . 

Since ry* = (772 *m 772 is supported on [1/4,1], 


1^ |2 ^ \m *M t\1 ^ j_ 

>C >C 


00 / /*oo 


'0 \^0 


^2(t)v5(y)yl dw 


4/ 

3 

4>r 


1 - 


1 


nl 


, , ^ (w\ dt , 
it)T [j) -T^dw 


poo 2 


mt) 


t 




ij) 


w\ dw\ 


1 dt 


3 , , X , r ,2 , ,2 3 32 „ 3 _ 1.77082 

= ■^\V2i.t)/Vi\2 ■ \t\2 = Y~ ■ < -, 

4k 4k 3 8 >£■ 

where we go from the first to the second line by Cauchy-Schwarz. 

Recalling the bounds on £^r,,.r.<5 n ^nd we obtained in (14.2 1 and (14.17 1 , 

we conclude that the second line of ( 10.36 1 is at most times 

1.33805 • 10"® o 

- 8.7806 + 2.3922 • 10"® • 1.6812 

K 

■ (\/8.7806 + 1.6812 • 0.80014) 


1.77082 1.7316 • 10"® 

- < - , 


where we are using the bound < 8.7806 we obtained in (14.7 1 . (We are also using 
the bounds on norms in ( |14.3| ) and the value xr = 49.) 

By the bounds ( |14.19| l, ( |14.23| l and ( |14.25| l, we see that the third line of ( |10.36| l is 
at most 

2 • (0.640209 log x) ■ (24.32 log a; + 0.57) • a; 

+ 4a/O. 640209 log X • 0.0362 log a;(18.57 log x + 28.39)x < 43(log x)‘^x, 

where we use the assumption x > x+ = 4.9 • 10^® (though a much weaker assumption 
would suffice). 


Using the assumption x > x+ again, together with (14.22 1 and the bounds we have 
just proven, we conclude that, for r = 150000, the integral over the major arcs 



{a, x^Sri, {a, x)e{—Na)da 
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IS 


Co ■ Cr,o,r,,x^ +0* ( 2.9387- 10 "® • + HSiiLiL_a;2 43(loga;)\ 


— C'o ■ C!rjQ,r],x‘^ + O* 


3.85628 • 10-5 • x'^ 


= Co ■ + 0*(7.86996 • IQ-V), 

(14.26) 

where C'o and Crjo^n, are as in (10.37 1 . Notice that CoCr]o,T^,x'^ is the expected asymp¬ 
totic for the integr al ove r all of I 


Moreover, by (14.9 1 , (14.14i and (14.4l, as well as \ip\i = 

') 


Co ■ Cno^n. > 1.3203236 


i|77o|i 0.000834\ 


Hence 


1.0594003 0.001102 1.058298 

> - > - . 

>£■ >r 49 

1.058259 , 


/ (a, x)‘^Srj, {a, x)e{—Na)da > 

Jma.r- 


(14.27) 


where, as usual, k = 49. This is our total major-arc bound. 


14.3 The minor-arc total: explicit version 


We need to estimate the quantities E, S, T, J, M in Theorem 13.2.1 Let us start by 
bounding the constants in (13.12 1 . The constants j, j = 0,1,2, will appear only 
in the minor term E, and so crude bounds on them will do. 

1.19073\ 


By (|14.3|l and (|14.24|l. 


sup 77 +(r) < min [ 1.07996, 

r>t 


t J 


for all f > 0. Thus, 


Cty+.o = 0.7131 y ^supr 7 +(r)^ dt 


< 0.7131 


1.07996^ 
3 Vi 


dt 


1.190732 

f5/2 


dt < 2.33744. 


Similarly, 


C^+.i = 0.7131 


< 0.7131 


logf 


1 Vi 

1.190732 log f 

L EC 


sup 77 +(r) ) dt 

r>t 


dt < 0 . 44937 . 


























268 


CHAPTER 14. CONCLUSION 


Immediately from (14.31, 


Cr,+ ,2 = 0.51942|77+|^ < 0.60581. 


We get 

E < ((2.33744 + 0.60581) logx + (2 • 2.33744 + 0.44937)) • 
< (2.94325 log a: + 5.12426) • < 8.4029 • lO'^^ • x, 


(14.28) 


where E is defined as in ( 13.1 1[), an d where we are using the assumption a; > = 

4.9 • 10^®. Using (14.17 1 and (14.22 1 , we see that 

^r,.(0,a:) = (|77*|i +O*(i;r^.,o))^= (^^72 + 0*(1.33805- 10-®)) 


Hence 


^^.(O,^) • E < 1.05315 • 10"^^ • —. 


We can bound 

S < ^ A(n)(logn)? 7 ^(n/a;) < 0.640209a; log a; — 0.021095a; 


(14.29) 


(14.30) 


by (14.18 1 . Let us now estimate T. Recall that Lp(fy = ‘ Since 

pu pu 3 

/ ip(t)dt = / t^e~* I'^dt < / t^dt = —, 
an an Jo 3 


we can bound 


Cv.3 ( log ^ ) = 


1.04488 f 


\A72 




0.2779 


((loga;/>r)/2)3' 


By ( |14.7[ ), we already know that J = (8.7052 + 0*(0.0754))a;. Hence 
{Vj - y/Ef = (v'(8.7052 + C)*(0.0754))a; - V8.4029 • IO -12 • xf 


> 8.6297a:, 


(14.31) 


and so 


T = C ^,3 Q log ^y{S-{Vj- VEf) 


Q n 0770 

< • (0.640209a; logX - 0.021095x - 8.6297x) 
(logx/>r)'^ 

< 0 17792^^^°^ - 2 40405__ 

- ■ (logx/x)3 ■ (logx/x)3 

< 1.42336-^—- 13.69293 ^ 


ilogx/xY 


(logx/>r)3' 
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for K = 49. Since x/x > 10^®, this implies that 


T < 3.5776 • 10-'‘ • X. 


(14.32) 


It remains to estimate M. Let us first look at g{ro)', here g = gxi>c,ip^ where gy^^ is 
defined as in (11.19 1 and ^(f) = as usual. WrA&y = xjx. We must estimate 

the constant Cy,^ 2 ,K defined in (11.21 1 ; 


a 


(p,2,K 


h/K 

/•i 


ip{w) logw dw < — / (p{w)\ogw dw 

Jo 


< — we 


2 - w '^12 


logw dw < 0.093426, 


where again we use VNODE-LP for rigorous numerical integration. Since \Lp\i = 
and K = (log y)/2, this implies that 


Cy,2,K/|y|i ^ 0.07455 
logiL “logi^ 


(14.33) 


and so 


_ 0.07455 / 

- log i2|j/ + I ^" 


0.07455 \ 


Ry.t- 




(14.34) 


Let t = 2ro = 300000; we recall that K = (log?/)/2. Recall from (14.16l that 
y = x/x > 10^^; thus, yjK > 3.47435 • 10^^ and log((log y)/2) > 3.35976. Going 
back to the dehnition of in (1 L13|l, we see that 


Ry,.2ro < 0.27125 log (1 + ) + 0.41415 < 0.58341, 

V 2 log 2 . 004 . 2.150000 / 

(14.35) 

Ry,K. 2 ro < 0.27125 log (1 + ^ < 0.60295, 

\ 2 . 004 . 2.150000 / 

(14.36) 

and so 


Ry.K.cp,2rQ ^ 


0.07455, 

3.35976 


0.60295 + 1 - 


0.07455 \ 
3.35976 ) 


0.58341 < 0.58385. 


Using 


O 

r (r) = e'*' log log r + —-- < 5.42506, 

log log r 


■ see from PL13|l that 


L 2 ro = 5.42506 . ( ^ log 300000 + 7.82 ) + 13.66 log 300000 + 37.55 < 474.608. 
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Going back to (11.191, we sum up and obtain that 


giro) = 


(0.58385 • log300000 + 0.5)\/5.42506 + 2.5 


474.608 


150000 
< 0.041568. 


3.36 


V2 
logy 
2y 


150000 

1/6 


Using again the bound x > 4.9 • 10^®, we obtain 


log(150000 + 1) + ^ _ ^^2 

log ^/x + c- 
13 9716 

< -j-^-(0.640209a; log a; - 0.021095a:) - 8.6297a: 

“ iloga: +0.6394 ^ ® ^ 

11 7332t 

< 17.8895X - , - 8.6297a; 

i log a: + 0.6394 

< (17.8895 - 8.6297)a; < 9.2598a:, 


where = 2.0532 and c = 0.6394. Therefore, 


5(^0) 


/ log(150000+ 1) + C-* 
V log Val + c- 


■ s-{Vj-Ve)A < 


0.041568 • 9.2598a; 


< 0.38492a;. 


(14.37) 


This is one of the main terms. 

Let ri = ( 3 / 8 ) 2 /"^/^^, where, as usual, y = x/x and x = 49. Then 


Ry^ 2 r^ = 0.27125 log 1 + 


= 0.27125 log 1 + 


< 0.27125 log 1 + 


log (8 • 12/4/15) \ 

—^ -/- + 0.41415 


2 log__ , 

2 . 004 -/ 


_ j|logy+ log3 

2(5 - logy+ 2log 

_n_^ 


0.41415 (14.38) 


2 . 004-1 


+ 0.41415 < 0.71215. 
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Similarly, for K = (log y )/2 (as usual), 


\ogi8jy^\ 


Ry/K,2ri I ^ ^ 9(y/if)i/3 ) 

2.004 |y‘i/i= / 


0.41415 


= 0.27125 log 1 + 


= 0.27125 log 3 + 


^logy + logS 


iiilogj/- ^ log logy+ 21og 


9 . 21/3 

2.004-1 


3 loglogy-c 


A logy - floglogy+ 21og 


12 - 21/3 

2.004 


where c = 41og(12 • 21/3/2.004) - log 3. Let 

4 


/(/) = 


3 log t — C 


M- |logf + 21og 


12/21/3 ■ 

2.004 


+ 0.41415 

+ 0.41415, 
(14.39) 


The bisection method with 32 iterations shows that 

f{t) < 0.019562618 


(14.40) 


for 180 < t < 30000; since /(f) < 0 for 0 < f < 180 (by (4/3) logf — c < 0) and 
since, by c > 20/3, we have f{t) < (5/2)(logf)/t as soon as t > (logt)^ (and so, in 
particular, for t > 30000), we see that (|14.40|i is valid for all f > 0. Therefore, 


Ry/K.2ri £ 0.71392, 


(14.41) 


and so, by (14.34 1 , we conclude that 


Ry ,K ,ip.2ri — 


0.07455 

3.35976 


0.71392 + 1 - 


0.07455/ 
3.35976 / 


0.71215 < 0.71219. 


Since ri = (3/8)y^/33 and T(r) is increasing for r > 27, we know that 


r(ri) < log log y'^/^3 + 


2.50637 


= e'*' log log y + ' 


log log y4/i5 
2.50637 15 


log log y-log ^ 
for y > 10^3. Hence, (|11.13[) gives us that 


— e'/ log — log log y — 1.43644 

(14.42) 


/ 1 O q \ q 

L 2 ri < (e^' log logy — 1.43644) f — log -^y^ + 7.82 j + 13.66 log -^y^ + 37.55 
13 

< —logy log logy + 2.39776 logy + 12.2628 log logy + 23.7304 
15 

< (2.13522 logy + 18.118) log logy. 
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Moreover, again by (14.421, 


y/r(r^ < i/eT' log logy- 


1.43644 


2Ve^logIo^ 


and so, hy y > 10'^°, 


(0.71219log I?/A +0.5)v/r(ri) 

< (0.18992 logy + 0.29512) [ y /log log y — 


1.43644 


2y/e-/ log logy 


< 


0.195057^; m ()^- 0-^9505 M. 

^ 2VeMoglogy 


< 0.26031 logyi/loglogy - 3.00147. 
Therefore, by ( 1 1.19| l, 

, , 0.26031 log yVlog log y + 2.5 - 3.00147 

9yAri) < -- 


(2.13522 logy + 18.118) log logy ^ 3.36((logy)/2)^/® 


O 


,1/6 


^ 0.30059 log y-s/log logy ^ 5.69392 logy log logy 

— 2 I 4 


0.57904 ^ 48.3147log logy , 2.994(logy)i/6 

2 I 4 I TTS 

yT5 yiE yVb 

^ 0.30059 log y-s/log logy 5.69392 logy log logy 1.30151(logy)^/® 

— 2 4“ 4 4“ 


,1/6 


< 


0.30915 log y\/log log y 


where we use y > 10^® and verify that the functions t >-)• (logf)^/®/f^/®“^/^®, t >->• 
yi/)^l/^/^4/i5-2/i5 ajjdf !->• (loglogf)/f^/^®“^/^® are decreasing for t > y (just by 
taking derivatives). 

Since k = 49, one of the terms in ( 13.13[) simplifies easily: 


7 -2.14938+^ log >r ^ 7 

15 logx + 2c“ ~ 15 


By (14.30 1 and y = x/k = x/49, we conclude that 

7 ^ 7 0.30915 logyVEfEIj (0.640209log, - 0.021095)x 


15 


< 


15 

0.14427 log y Vlog log y 


(0.640209logy + 2.4705)a: < 0.30517x, 


(14.43) 
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where we are using the fact that y (log 2 /)^-yiogis decreasing for y > 
10 ^® (because y (log is decreasing for y > and 10^® > 

It remains only to bound 


2S 


9{r) 


log X + 2c~ 


dr 


in the expression ( |13.13| ) for M. We will use the bound on the integral given in ( |13.33| ). 
The easiest term to bound there is /i(r’o), defined in (13.34 1 , since it depends only on 
rg: for rg = 150000, 

/i(rg) = 0.0169073... . 


It is also not hard to bound / 2 (rg, x), also defined in (13.34 1 ; 
f 2 [ro,y) = 3.36--log 


< 3.36 


(log y) 


(.1/6 

1/6 


ro 


log y + 0.05699 — log rg I , 


(22/)i/6 1^15 

where we recall again that x = xy = 4Qy. Thus, since rg = 150000 and y > 10^5, 

/ 2 (rg, 2 /)< 0.001399. 


Let us now look at the terms Ii r, Cip in (13.35 i. We already saw in (14.331 that 
C^,2I\t\i / 0.07455 


C-tn — 


< 


\ogK -loglttp- 


< 0.02219. 


Since Fit) = e'*' logf + Cg, with Cg = 1.025742, 


h,To = ■F'(logrg) + 


2e^ 

logrg 


= 5.73826... 


(14.44) 


It thus remains only to estimate Io,ro,ri,z for z = y and z = y/K, where K = 
(logy)/2. 

We will first give estimates for y large. Omitting negative terms from (13.35 i, we 
easily get the following general bound, crude but useful enough: 


.^0,ro,i-i,z ^ ^z,2ro 


P 2 (log 2 rg) ^z, 2 ri - 0-414152 P 2 (log2rg) 




log; 




where P 2 {t) = + 4t 4- 8 and P 2 (t) = 2t^ + 16f + 48. By (14.38 1 and (14.41 1 , 

-Ry,2ri < 0.71215, Ry/K.,2r^ < 0.71392 

for y > 102®. Assume now that y > 10^®°. Then, since rg = 150000, 

log 4rg 


Ry 


< 0.27125 log 1 + 


9^(1015^ 

^ ^Og 2.004ro . 


0.41415 < 0.43086, 
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and, similarly, Ryjx^ro ^ 0.43113. Since 


0.43086^ • ^ 0.10426, 0.43113^ • -^^(lo^ro) ^ 0.10439, 






we obtain that 


(1 ^v)V^0,ro,ri,y + ,ri, 


< 0.97781 • Jo. 10426 + 


0.49214 


15 


0.02219J0.10439- 


logy-log 400000 (14.45) 

< 0.33239 


0.49584 


^ log y — log 400000 


fory>10i®°. 

For y between 10^® and 10^®°, we evaluate the left side of (14.45 1 directly, using 
the definition (13.35 i of /o,ro,ri ,2 instead, as well as the bound 

0.07455 

Cep < 


log*^ 


from (14.331. (It is clear from the second and third lines of (13.32 1 that /o,ro,ri,z is 
decreasing on z for tq, ri fixed, and so the upper bound for Cep does give the worst case.) 
The bisection method (applied to the interval [25,150] with 30 iterations, including 30 
initial iterations) gives us that 

(1 - Cep)^yIo,ro,rl,y + ^4,^0 .ri, ^ 0.4153461 (14.46) 


for 10^® < y < 10^^°. By (14.45 1 , (14.46 1 is also true for y > 10^^°. Hence 


/o(ro,y)< 0.4153461 

By ( |13.33| l, we conclude that 

r 9ir) 




:5.73827 < 0.071498. 


dr < 0.071498 + 0.016908 + 0.001399 < 0.089805. 


By ( [T4301 I, 
2S 


< 


2(0.640209a; log a: - 0.021095a;) 


log X + 2c“ log X + 2c~ 

where we recall that c~ = 0.6294 > 0. Hence 


< 2 • 0.640209a; = 1.280418a;, 


2S 


5(J 


log X + 2c~ 


dr < 0.114988a;. 


(14.47) 


'■To 
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Putting (14.37 1 , ( 14.43| l and ( 14.47| l together, we conclude that the quantity M 
defined in (13.13 1 is bounded by 


M < 0.38492a: + 0.30517a: + 0.114988a: < 0.80508x. 


(14.48) 


Gathering the terms from (14.291, (14.32 1 and ( 14.48| l, we see that Theorem 13.2.1 
states that the minor-arc total 


Zro= l'5'^.(a,a:)||S'^^(a,x)pdQ: 


is bounded by 





for ro = 150000, x > 4.9 • 10^®, where we use yet again the fact that \{p\i = yJ-Kj^. 
This is our total minor-arc bound. 


14.4 Conclusion: proof of main theorem 

As we have known from the start, 

A(ni)A(n2)A(n3)ry+(ni)ry+(n2)?7*(n3) 

= / Srj^{a,x)'^Srf,{a,x)e{—Na)da. 
Jmiz 

We have just shown that, assuming N > 10^^, N odd, 

/ Sr^^{a,x)'^ Sri,{a,x)e[—Na)da 

= / Sri^{a,x)‘^Srj,{a,x)e{—Na)da 


O* 


' {'R/Z)\'I0ls,rQ 


\S^_^^{a,x)\'^\S^,{a,x)\da 


> 1.058259— -fO* 1.00948— > 0.04877 


(14.50) 


for xg = 150000, where x = N/(2 + 9/(196-\/27r)), as in (14.15i. (We are using 
(14.27 1 and (14.491.) Recall that = 49 and 77 *(f) = (772 *m p){xt), where tf{t) = 
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It only remains to show that the contribution of terms with ni, n 2 or 713 non-prime 
to the sum in (14.50 1 is negligible. (Let us take out ni, n 2 , n .3 equal to 2 as well, since 
some prefer to state the ternary Goldbach conjecture as follows: every odd number > 9 
is the sum of three odd primes.) Clearly 


A(ni)A(n2)A(n3)?7+(ni)?7+(n2)?7*(n3) 

ni-\-n2-\-n2—N 
Til, ‘^2 or ri-3 even or non-prime 

< 3|77+|^|p*|oo A(ni)A(n2)A(n3) 

ni-\-n2-\-n3—N 
ni even or non-prime 

< 3|?7+|^|77*|oo-(logAf) 

ni < non-prime 712 

orni —2 


(14.51) 


By 
and 13], 

Hni) < 1.4262ViV -f log 2 < 1.4263^, 

rii < N non-prime 
or n 1 —2 

Mni) ^(" 2 ) = 1.4263VN ■ 1.03883Af < 1.48169iV^/^. 

ni < non-prime n2<A^ 

or ni —2 


14.31 and {U2i\, |ry+|oo < 1.079955 and |??*|oo < 1.414. By llRS^ Thms. 12 


Hence, the sum on the hrst line of ( |14.51| l is at most 

7.3306iV3/2 


Thus, for N > lO^^ odd 


E 


A(ni)A(n2)A(n3)?7+(ni)?7+(n2)?7*(n3) 


ni-\-n2-\-n3—N 
rii, 712 , odd primes 


> 0.04877— - 7.3306Af3/2 

> 0.00024433Af2 - 1.4412 • 10"“ • > 0.00024431V^ 


by >f = 49 and (14.151. Since 0.0002443iV^ > 0, this shows that every odd number 
N > 10^^ can be written as the sum of three odd primes. 

Since the ternary Goldbach conjecture has already been checked for all N < 8.875 • 
lO^o iHPlAl . we conclude that every odd number N > 7 can be written as the sum 
of three odd primes, and every odd number A^ > 5 can be written as the sum of three 
primes. The main result is hereby proven: the ternary Goldbach conjecture is true. 
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Appendix A 

Norms of smoothing functions 


Our aim here is to give bounds on the norms of some smoothing functions - and, in 
particular, on several norms of a smoothing function 774 . : [ 0 , 00 ) —?► K based on the 
Gaussian 

As before, we write 


h : t 1 -^ 


i2(2_i)3e‘-i/2 

0 


ift e [ 0 , 2 ], 
otherwise 


(A.l) 


We recall that we will work with an approximation 77 + to the function 770 : [0, 00 ) —> K 
defined by 


Vo{t) = h{t)r]^{t) = 


t^(2 - f)3e-(*-i)"/2 for t e [ 0 , 2 ], 
0 otherwise. 


The approximation 77 + is defined by 

r]+{t) = hH{t)te~^^, 


where 


Fnit) = 


sin(Tf logy) 


TTlogy 

pOO 

hnit) = {h *M FH){y) = / h{ty~^)FH{y) 

Jo 


y 


(A.2) 


(A.3) 


(A.4) 


and iJ is a positive constant to be set later. By (2.8 1 , Mhn = Mh- MFh- Now Fh is 
just a Dirichlet kernel under a change of variables; using this, we get that, for r real, 

'1 if|r|<£r, 

MFniir) = { 1/2 
0 
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if jrj < H, 
if jrj = FI, 
if jrj > H. 


(A.5) 
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Thus, 


r Mh^ir) if |r| < H, 
MhniiT) = < ^Mh{iT) if |r| = H, 
[o if \t\>H. 


(A. 6 ) 


As it turns out, h, rjo and Mh (and hence Mhn) are relatively easy to work with, 
whereas we can already see that hu and have more complicated definitions. Part 
of our work will consist in expressing norms of hn and 77 + in terms of norms of h, rjo 
and Mh. 


A.l The decay of a Mellin transform 


Now, consider any (j) : [0, 00 ) —>■ C that (a) has compact support (or fast decay), (b) 
satisfies = 0(1) for f —0+ and 0 < k < 3, and (c) is everywhere and 

quadruply differentiable outside a finite set of points. 

By definition, 

/■°° hr 

Mcjjis) = / (j)ix)x^ — . 

Jo a; 

Thus, by integration by parts, for 3fi(s) > —1 and s 7 ^ 0, 


M(j){s) 



(j){x)x‘^ 


dx 

X 


r°° dr r°° r® 

lim / d>(x)x ^— = — lim / M — d,^ 

t^0+ Jf. ^ ' X t^0+ Jt ^ ' s 


= lim 
t^o+ 



rix) 


a;®+^ 
s(s + 1) 


dx 


lim 

(->■0+ 


(j)^^\x) 




s(s + 1 )(5 2 ) 


dx 


/■oo .^*+3 

= lim / (j)^‘^\x)— - --——dx, 

Jt s(s + l)(s + 2)(s + 3) 

(A.7) 

where is understood in the sense of distributions at the finitely many points 

where it is not well-defined as a function. 

Let s = it, 4> = h. Let Ck = limj_j.Q+ {x)\x^~^dx for 0 < fc < 4. Then 

(|A.7|i gives us that 


Mh{it) < min 


Co A 
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L 3 


|f| ’ |f||f + z|’ |f||f + 7||t + 2 i| ’ \t\\t + i\\t + 2i\\t + 3i\ 


(A. 8 ) 


We must estimate the constants Cj , 0 < j < 4. 

Clearly, h{t)t ^ = 0(1) as t ^ 0+, h^{t) = 0(1) as f —0+ for all A: > 1, 
h(2) — h'(2) = /i"(2) = 0, and h{x), h'{x) and h''{x) are all continuous. The 
function h'" has a discontinuity at f = 2. As we said, we understand in the sense 
of distributions at f = 2 ; for example, lime_j.o J 2 -I = limc_j.o(L ’^^)(2 + e) — 

/r(3)(2-e)). 

Symbolic integration easily gives that 


Co= [ t{2 - tfe^-^/'^dt = 92e"^/2 - 126^/^ = 2.02055184... (A.9) 

Jo 








A.l. THE DECAY OE A MELEIN TRANSEORM 
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We will have to compute Ck, 1 < fc < 4, with some care, due to the absolute value 
involved in the definition. 

The function (a;^(2 — = ((a;^(2 — x)^)' + x‘^(2 — has the 

same zeros as Hi{x) = (a;^(2 — a;)^)' + a:^(2 — x)^, namely, —4, 0, 1 and 2. The sign 
of Hi{x) (and hence of h'{x)) is + within (0,1) and — within (1,2). Hence 


Cl = / \h'{x)\dx=\h[l)-h{G)\ + \h{2)-h{T)\=2h{l) = 2yfe. (A.IO) 

JO 

The situation with (a ::^(2 — x)^e“ i/^)" is similar: it has zeros at the roots of 
H 2 {x) = 0, where H 2 {x) — Hi{x) + H[{x) (and, in general, Hk+i{x) = Hk{x) + 
H'i^{x)). This time, we will prefer to find the roots numerically. It is enough to find 
(candidates for) the roots using any available too|^ and then check rigorously that the 
sign does change around the purported roots. In this way, we check that H 2 {x) = 0 has 
two roots 02 , 1 , 0 : 2,2 in the interval ( 0 , 2 ), another root at 2 , and two more roots outside 
[ 0 , 2 ]; moreover. 


0 : 2,1 = 0.48756597185712..., 
02,2 = 1.48777169309489..., 


where we verify the root using interval arithmetic. The sign of H 2 {x) (and hence of 
h''{x)) is first +, then —, then +. Write 02,0 = 0, 02,3 = 2. By integration by parts, 

fOO r(^2,2 /*2 

C 2 = j \h''{x)\xdx= / h''{x)xdx— / h''{x)xdx+ / h''{x)xdx 
Jo Jo Jot2,l Jci'2,2 

= \ h'- f h'{x) dx J 

i = l \ J 

2 

= 2 ^(-1)^+1 {h'{a2,j)a2,j - h{a2,j)) = 10.79195821037... . 

(A. 12) 

To compute C 3 , we proceed in the same way, finding two roots of H^{x) = 0 
(numerically) within the interval ( 0 , 2 ), viz.. 


03.1 = 1.04294565694978... 
03,2 = 1.80999654602916... 


The sign of H-i{x) on the interval [0, 2] is first —, then +, then —. Write 03^0 = 0, 
03 3 = 2. Proceeding as before - with the only difference that the integration by parts 


'Routine f ind_root in SAGE was used here. 
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is iterated once now - we obtain that 

/•OO ^ /*^3,i 

C'a = / \h'''{x)\x^dx = '^ 2 {—iy h'''{x)x^dx 

Jo JoL^J-l 

= ( ^"(a;)a;^la 3 ’ 5 -i - f h”{x) ■ 2x j dx 

3 = 1 V / 

= ^(-1)^ {h'\x)x^ - h'{x) ■ 2 x + 2 h{x)) 
i-1 
2 

= {a3,j)alj - 2h'{a3j)a3,j + 2/i(a3j)) 

3 = 1 

and so interval arithmetic gives us 

C 3 = 75.1295251672... 


(A. 13) 


(A. 14) 


The treatment of the integral in is very similar, at least as first. There are two 
roots of H^x) = 0 in the interval (0,2), namely, 

a4,i = 0.45839599852663 ... 
a4,2 = 1.54626346975533 ... 

The sign of H 4 {x) on the interval [0,2] is first —, +, then —. Using integration by parts 
as before, we obtain 



h^^\x) 


x'^dx 


rotA,i 

I0+ 


rOCA,2 

h^'^\x)x^dx+ / h^'^\x)x'^ dx — / h^‘^\x)x^dx 

0:4,1 04,1 

2 

= 2^(—1)^ ~ ih^^\a4j)a^ j + 6h'{a4j)a4j — 6h{a4j)^ 


- lim = 1152.69754862 ..., 

t — y2 


since lim(_j.o+ = 0 for 0 < fc < 3, limi_j. 2 - = 0 for 0 < fc < 2 and 

limt^ 2 -=-24e3/2. Now 



\h^‘^\x)x^\dx = 


lim |li(3)(2 + e)_/i(3)(2 


e)|.23 


2^ • 24e3/^ 


Hence 


C 4 



/rW(x) 


x^dx + 246^/2 . 23 = 2013.18185012 ... 


(A. 15) 
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283 


We finish by remarking that can write down Mh explicitly: 

Mh = -2) + 127(s+3, -2)+67(s+4, -2)+7(s+5, -2)), 

where 7 ( 3 , x) is the (lower) incomplete Gamma function 


(A. 16) 


7(s,x) = [ e H 
JO 


— t±s — l 


dt. 


We will, however, hnd it easier to deal with Mhhy means of the bound (A. 81 , in part 
because ( |A.16| l amounts to an invitation to numerical instability. 

For instance, it is easy to use (A .81 to give a bound for the £i-norm of Mh{it). 


Since C4/C3 > C3/C2 > C2/C1 > Cl/Co, 

pOO 

\Mh{it)\i = 2 / Mh{it)dt 
Jo 


<2(Co^+Ci 

\ '-'0 


=2 Cl + Cl log 


/■C 2 /C 1 

/ J +<^2 

ICilCo ^ 


rC3/C2 p. 

^+C3 


'C2/C1 


f2 


fCCC3 dt 


IC3/C2 


f3 


/C4/C3 


dt 

¥ 


C2C0 

Cl 


C2 


Ch_C2 

C2 C3 


Cl fCl 


Cl 


Cl Cl 


+ 


Ci cl 
3 ' C| 


and so 


\Mh{it)\i < 16.1939176. 


(A. 17) 


This bound is far from tight, but it will certainly be useful. 
Similarly, \{t + i)Mh{it)\i is at most two times 


Co / \t i\ dt Cl 


, C 2 

c, 


1 + 


t 


dt + C 2 


C 3 

dt 

C 2 t 
Cl 


Cl 


C 4 

f C 3 dt 

/c3 ¥ 


C 4 


f°° dt 

U ¥ 


= ^ ( + ^ + sinh ^ ^ + Cl ( V¥¥i + log 


let c 


C 4 ^ /^2 
0 ^0 


+ C 2 log 


-I Cl 

Co 

Cl C4 


'V¥¥i-i\\ ,gi 


+ C3|---|4||. 


and so 


\{t + i)Mh(it)\i < 27.8622803. 


(A. 18) 


A.2 The difference 77 + — rjo in £2 norm. 

We wish to estimate the distance in £2 norm between po and its approximation 77 +. This 
will be an easy affair, since, on the imaginary axis, the Mellin transform of 774 . is just a 
truncation of the Mellin transform of po ■ 
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By (A.2 1 and (A.3 1 , 


\ri+-rio\l= / hH{t)te 


dt 


, dt 


(A. 19) 


< ^nmxe * • J \hH{t) — h{t)\^ ^ . 

The maximum maxopi^e”*^ is (3/2)^/^e“^/^. Since the Mellin transform is an 


isometry (i.e., ( 2.61 holds), 


poo 7 i -| poo -| poo 

/ \hH{t) — h{t )\'^— = — / \MhH{it) — Mh{it)\‘^dt = — / \Mh{it)\'^dt. 

Jo t 27 r TT Jh 

(A.20) 


By (A.8l, 


Hence 


\Mh{it)\'^dt < 


IH 


IH 


oo ^2 /^2 

- 7HT 


r°° dt 


dt , Cl 


InW 


Using the bound (A.15 1 for C^, we conclude that 


\ri+ - t?o |2 < 


Ca / 3 1 ^ 274.856893 


y/7^\2ej 7 / 7/2 - H7/2 

It will also be useful to bound 

{v+{t)-Voit)f^ogtdt 


(A.21) 


(A.22) 


(A.23) 


This is at most 


( maxe ‘ f^| logfl 

\t>o 


\hH{t) - h{t)\^j. 


Now 


i>0 


maxe * f'’|logf| = max max e ' logf), max e ' f'^logf 


^te[o,i] 

= 0.14882234545... 


tG[l,5] 


where we find the maximum by the bisection method with 40 iterations (see |2.6| l. 
Hence, by \A.22) , 


{r]+it) — ??o(f))^| logf|df < 0.148822346 


Ttt 


27427.502 /165.61251 \ 

< - < 

- 7/7 - 


2 (A.24) 

7/7/2 ) ■ 
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A.3 Norms involving 77 + 


Let us now bound some ii- and £ 2 -norms involving rj+. Relatively crude bounds will 
suffice in most cases. 

First, by ( |A.23| l, 

b+|2 < |?7o|2 -b |??+ - ??o|2 < 0.800129 + 
b+b > |?7 o|2 - |»?+ - ?7o|2 > 0.800128 - 


(A.25) 


where we obtain 


H7/2 


|7?o|2 = VO.640205997... = 0.8001287... 


(A.26) 


by symbolic integration. 


Let us now bound \r]^ • log By isometry and (2.10 1 , 


l-ft^ log 1 ^= 2 - 


y+200 


\M{r]+ •log)(s)b(is = — 


k +ZOO 


\{Mr^+ns)\^ds. 


Now, {Mrj^y{1/2 + it) equals l/27r times the additive convolution of Mhniit) and 
{Mrj^y{1/2 + it), where = te~* Hence, by Young’s inequality, 

|(M?7+)'(1/2+ if)|2 < -^\MhH{it)\i\{Mr]<^y{1/2 + it)\2. 

Again by isometry and ( |2.10| l, 

\{Mr]^y{1/2 + it )\2 = V^lvo ■ log b- 

Hence, by ( |A.17| l, 

\V+ ■ logb < ■^\MhH{it)\i\T]<^ ■ logb < 2.5773421 • • logb- 

Since, by symbolic integration. 


!??❖ • log b < Y ^ 2)2 -b 272 + 7r2 + 8(7 - 2) log 2 - 87 ) ^^ 27 ) 

< 0.3220301, 

we get that 

I 7 +• logb < 0.8299818. (A.28) 

Let us bound \r]+{t)t°'\i for a G (—2, 00 ). By Cauchy-Schwarz and Plancherel, 

lhff{t)/'/il2 


\'q+{t)t‘^\i = hH{t)t^^'^e ‘ 

^ ^a+3/2^-t^/2 


< 


^rT+3/2g-tV2 


\hH{t)\^J = 


< 


^CT+3/2g-tV2 


J \Mh{ir)\'^dr = 


^<T+3/2g-tV2 

^f7+3/2g-tV2 


— [ |M7i(zr)|2dr 
271" J-H 


\h{t)/Vi\2. 


(A.29) 
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Since 


^.+3/2g-tV2|^ = ^ 

\hit)/Vi\2 = 


/31989 

585e3 

V 8e 

8 


< 1.5023459, 


we conclude that 


for a > —2. 


\il+{ty\i < 1.062319 • Vr(cr + 2) 


(A.30) 


A.4 Norms involving 


By one of the standard transformation rules (see (2.10i), the Mellin transform of -q'j^ 
equals —(s — 1) • Mr]+{s — 1). Since the Mellin transform is an isometry in the sense 
of ( [Z^ , 




1 *1-ss 


ds = 


C—k+ioO 


2Tri 


|s • M? 7 _|_(s)|^ ds. 


— ^—lOO 


Recall that ry+(f) = hH{t)ri(^{t), where ry^(f) = te~* Thus, by (2.9 1 , the func¬ 
tion Mr]+{—1/2 + it) equals l/27r times the (additive) convolution of Mbniit) and 
Mr]<^{—1/2 + it). Therefore, for s = —1/2 + it. 


/ ri 

Mh{ir)Mr]^{s — ir)dr 

-H 

3 

< — / |ir — l||M/i(jr)| • |s — *r||M77i;?(s —'jr)|(ir (A.31) 

271" J-H 


= ^(/.9)((), 


where /(f) = \it — l\\Mh{it)\ and g{t) = \ — 1/2 + ff|1/2 + it)\. (Since 
|(—1/2 + i(f — r)) + (1 + fr)| = 11/2 -I- if I = |s|, either | — 1/2 + z(f — r)| > |s|/3 or 
jl+zr| > 2|s|/3; hence |s—*r||fr—l| = | —l/2-(-i(f—r)||l+ir| > |s|/3.) By Young’s 
inequality (in a special case that follows from Cauchy-Schwarz), \f * g \2 < |/|i|p| 2 - 
By ( [AT8] l, 

I/ll = \{r + i)Mh{ir)\i < 27.8622803. 

Yet again by Plancherel, 

/ - j-l-ioo 

\s\^\Mg^{s)\^ds 

- ^ —200 

rh+i^ SttI 

= / \{M{g'^)){s)\^ds = 2nW^\l = —. 

J i —200 ^ 
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Hence 




13 /s 5 

< • 27.86228031/^ < 10.845789. 

“ V^ 27 r V 4 - 


(A.32) 


Let us now bound for a G (—1, c»). First of all. 


w+{t)ni = 


< 

< 


‘ +/i//(f)(l - f^)e * 


/i^(f)f‘"+^e * /2 ^ + |??+(f)t'" ^|i + |? 7 +(f)r+i|i. 
We can bound the last two terms by (A.30 1 . Much as in ( A.29|l, we note that 




a+\„-ei1 


< 


^tT+l/2„-tV2 


\h'H{t)Vi\2, 


and then see that 


|/iff (f)v^|2 = \j~J^ dt = J \M{Nfj){l + ir)\^dr 

= \{—ir)MhH{ir)\'^dr = J \{—ir)Mh{ir)\‘^dr 

= ]J^ J \Mih'){l + ir)\‘^dr < ^ ^ J \M{h'){l + ir)\‘^dr = \h'{t)Vi\ 2 , 


where we use the first rule in (|2.10|i twice. Since 


t+ 1/2 -tV2 


we conclude that 




/103983 

1899e3 

V 16e 

16 

■ + 3)) + Y 

/r(a + l) 

2 


= 2.6312226, 


< 2.922875v/r(cr + 1) + 1.062319v/r(cr + 3) 


for cr > —1. 


(A.33) 
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A.5 The ^oo-norm of 77+ 

Let us now get a bound for |77+|oo- Recall that ri+{t) = where ryo(f) = 

Clearly 


b+loo = \hH{t)vo{t)\oo < holoo + \{h{t) - hH{t))r](^{t)\c 
h{t)-hH(t) 


< bo 


t 


\v-0‘(J')Aoo- 


(A.34) 


Taking derivatives, we easily see that 

bo|oo '^o(l) 1; bo(^)^loo 2/e. 

It remains to bound | {h(t) — hHit))/t\ao- By ( |7^ , 

t \ sin w 


/,*(!)= /"/.((r')™'"'"*'’'*''’' 


The sine integral 


TTlogy y J-Hiog^ 
sinf 


oW j H 


7TW 


dw. (A.35) 


Si(a;) = f 
Jo 


t 


-dt 


is defined for all x; it tends to 7r/2 as cc —+(X) and to —7r/2 as a; ——oo (see IIAS641 
. We apply integration by parts to the se 

1 /■°° f d 

hH{t)-h{t) = — i^h 

J-H\ogi \dw 


(5.2.25)]). We apply integration by parts to the second integral in (A.35 1 , and obtain 

t 


l-Hlogi 

I r d 


u/H 


Si{w)dw — h{t) 


to 


t 


dw V 


J-H\og ^ 


lb,’' 


t 


u/H 


(Si(w) - I) dw 
(Si{w) 


+ T) dw. 


Now 


<' I 


u/H 


H 




u/H 


< 


t\h'\ 




Integration by parts easily yields the bounds | Si(a:) — 7r/2| < 2/x for a; > 0 and 
I Si(a;) + 7r/2| < 2/|a;| for x < 0; we also know that 0 < Si(x) < x < 7r/2 for 
X S [0,1] and —•7r/2 < x < Si(x) < 0 for x e [—1, 0]. Hence 




OO 2e-™/-f^ 


dw 


w 


where Ei is the exponential integral 

e"* 

£^ 1 ( 2 ;) = J —dt. 


























A.5. THE e^-NORM OFr]+ 


289 


By IIASMI (5.1.20)], 


0 < Ei{l/H) < 


log jH + 1) 

ol/H ‘ 


and, since log(i?+l) = logiJ+log(l + l/i/) < \ogH+l/H < (log H){1 + 1/H) < 
([ogH)e^/^ for i/ > e, we see that this gives us that Ei{l/H) < logH (again for 
H > e, as is the case). Hence 


Ihnit) - h{t)\ 


< \h'[ 


, _jL 4\ogH 

1-e « +-— 

TT H 


< \h'[ 


lE^logH 
^ H 


(A.36) 


and so, by (|A.34|i, 


h+loo < 1 + - 

e 


h{t) - hH{t) 


2, ,, 1 E-logH 

< l + -|/l loo ^ . 


By (A. 11 1 and interval arithmetic, we determine that 

\h'\^ = \h'{a2,2)\ < 2.805820379671, 


(A.37) 


where 02,2 is a root of h"{x) = 0 as in (A. 11 1 . We have proven 

l + |logi7 


Ip+Ioo < 1 + -•2.80582038- 


H 


1 + - log H 

< 1 + 2.06440727--- ^ - . (A.38) 

H 


We will need three other bounds of this kind, namely, for 77 +(f) logf, r]+(t)/t and 
We start as in (A.34i: 


(A.39) 


h+l0gf|oo < |t?ologf|oo + \{Ht) - hH{t))V0it)logt\^ 

< \Vo logfloo + \{h - hH{t))/t\oo\r]0{t)t^0gt\oo, 

|?7+(f)/f|oo < |7?o(f)/f|oo + \{h- hHit))/t\aa\voit)\oa 
\V+{t)t\<>c < |t?o(f)f|oo + \{h - 

By the bisection method with 30 iterations, implemented with interval arithmetic, 

|77o(f) logfjoo < 0.279491, logfjoo < 0.3811561. 

Hence, by ( |A.36| l and ( |A.37 1 , 

, , 1 + - log H 

\V+ logfloo < 0.279491 + 1.069456 • -- — . (A.40) 

H 

By the bisection method with 32 iterations, 

|??o(f)A|oo < 1.08754396. 

(We can also obtain this by solving { r]o(t)/ t)' = 0 sym bolically.) It is easy to show 
that A^loo = l/v/e- Hence, again by (A.36l and (A.37 1 , 


|?7+(f)Aloo < 1.08754396 + 1.70181609 


1 + f log 77 
77 


(A.41) 
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By the bisection method with 32 iterations, 

\rio{t)t\^ < 1.06473476. 

Taking derivatives, we see that |?7o(i)i^|oo = Hence, yet agai 

and ( |A37 ] i, 

, / X , 1 + - log H 

\v+{i)t\oo < 1-06473476 + 3.25312 - 

M 


by (A.36 1 


(A.42) 






Appendix B 

Norms of Fourier transforms 


B.l The Fourier transform of 7/3 


Our aim here is to give upper bounds on [772 loo > where 772 is as in 
considerably better than the trivial bound I 77 " loo < |77"|l. 

Lemma B.1.1. For every t G M, 

|4e(-t/4) - 4,e{-tl2) + e(-t)| < 7.87052. (B.l) 

We will describe an extremely simple, but rigorous, procedure to find the maxi¬ 
mum. Since |(?(i)P is (in fact smooth), there are several more efficient and equally 
rigourous algorithms - for starters, the bisection method with error bounded in terms 

ofKiffprioo. 


(3.4i. We will do 


Proof. Let 


For a < t < b. 


g{t) = 4e(-f/4) - 4e(-f/2) -f 


git) =g{a) + ^—“(5(6) -5(0)) -f ^{b - af ■ 0*( max \g”iv)\). 

0 — Qj o vG 


(B.2) 

(B.3) 


(This formula, in all likelihood well-known, is easy to derive. First, we can assume 
without loss of generality that a = 0, 6 = 1 and g{a) = g{h) = 0. Dividing by g 
by git), we see that we can also assume that g{f) is real (and in fact 1). We can also 
assume that g is real-valued, in that it will be enough to prove ( |B.3| l for the real-valued 
function fig, as this will give us the bound g{t) = ftg{f) < (1/8) max„ |(3fi(7)"(u)| < 
max„ \g''{v) \ that we wish for. Lastly, we can assume (by symmetry) that 0 < f < 1/2, 
and that g has a local maximum or minimum at t. Writing M = max„g[Q \g"(u)\, 
we then have; 
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as desired.) 


We obtain immediately from (B.31 that 

max \g{t)\ < max(|g(a)|, |g(&)|) + hb-af ■ max \g”{v)\. (B.4) 

te[a,b] 8 uG[a,h] 


For any v G 


l 5 "(t^)| < (^) • 4 + TT^ • 4 + (27r)2 = Qtt^ 


(B.5) 


(B. 6 ) 


Clearly g(t) depends only on t mod 47 r. Hence, by (B.4i and (B.5 i, to estimate 

max|p(f)| 

with an error of at most e, it is enough to subdivide [ 0 , 47 r] into intervals of length 
< \/ 8 e/ 97 r 2 each. We set e = 10“® and compute. □ 

Lemma B.1.2. Let g 2 ■ R’*' —>-]S.be as in {3.4 1. Then 

|^u< 31.521. 

This should be compared with |ry 2 li = 48. 

Proof. We can write 

= 4(4(5i/4(a;) - 45i/2{x) + Si{x)) + f{x), 
where Sxg is the point measure at xq of mass 1 (Dirac delta function) and 

{ 0 if a: < 1/4 or a; > 1, 

-Ax-^ ifl/4<x<l/2, 

4a:“^ if 1/2 < X < 1 . 


(B.7) 


Thus ri 2 {t) = Ag{t) + f{t), where g is as in (B.2i. It is easy to see that \f'\i = 
2maxx f{x) — 2mina; /(x) = 160. Therefore, 


fit) = f'it)/{2TTit) 


< 


\r\ 


80 


(B. 8 ) 


27r|f| 7r|f| 

Since 31.521 — 4 • 7.87052 = 0.03892, we conclude that ( |B. 6 | l follows from Lemma 


B.1.1 and (B .8 i for |f| > 655 > 80/(7r • 0.03892). 


It remains to check the range t G (—655, 655); since 4p(—f)+/(—f) is t he co mplex 
conjugate of Ag{t) + f{t), it suffices to consider t non-negative. We use (B.4i (with 
Ag + f instead of g) and obtain that, to estimate maxtgR \ Ag + f{t) \ with an error of at 

most e, it is enough to subdivide [0,655) into intervals of length < \/2e/|(45 + 7)"|oo 
each and check \ Ag + f{t)\ at the endpoints. Now, for every t S M, 
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By this and (B.5i, \{Ag + /)"|oo < 487r^. Thus, intervals of length (5i give an error 
term of size at most 2ATr^5\. We choose Si = 0.001 and obtain an error term less than 
0.000237 for this stage. 

To evaluate f(t) (and hence 4g(t) + f{t)) at a point, we integrate using Simpson’s 
rule on subdivisions of the intervals [1/4,1/2], [1/2,1] into 200 • max(l, [y/jflj) sub¬ 
intervals eachj^ The largest value of /(f) we find is 31.52065 ..., with an error term 
of at most 4.5 • 10“®. □ 


B .2 Bounds involving a logarithmic factor 

Our air 

y > 4. 


Our aim now is to give upper bounds on \g”y~^\oo, where ri(^y){t) = log(yf)?72(f) and 


Lemma B.2.1. Let 772 : 

y > 4. Then 


be as in {3.4\. Let r]i^y)(t) = \og{yt)r] 2 {t), where 

(B.9) 


\v[y)\i < (logy)|P2li- 

Proof. Recall that supp(?72) = (1/4,1). For t G (1/4,1/2), 

= (4bg(.()log4,)' = 1]?^ + ^ > 0, 


whereas, for t G (1/2,1), 

' M t 41ogyt 41ogf 

V(y)(t) = (-41og(yt) logt) =-^-— 


4 log yt^ 
t 


< 0 , 


where we are using the fact that y > 4. Hence 'rj[y){t) is increasing on (1/4,1/2) and 
decreasing on (1/2,1); it is also continuous at f = 1/2. Hence |'y(y)|i = 2|?7(y)(l/2)|. 
We are done by 

2 |y(y)(l/2)| = 21og| • P2(1/2) = log I • 81og2 < logy •81og2 = (logy)|?72|i- 

□ 

Lemma B.2.2. Let y > 4. Let g(t) = 4e(—f/4) — 4e(—f/2) + e(—f) and k{t) = 
2e(—f/4) — e{—tj2). Then, for every t G K, 


\g{t) ■ logy — k{t) ■ 41og2| < 7.87052logy. 


(B.IO) 


Proof By Lemma B. 1.1 |y(f)| < 7.87052. Since y > 4, k{t) ■ (4log 2)/logy < 6. 


For any complex numbers zi, Z 2 with jzij, |z2| < £, we can have |zi — Z2I > (■ only if 
I arg(zi/ 2 ; 2 )| > tt/S. It is easy to check that, for all t G [—2, 2], 


/ g(t) - logy ] 



^''^[41og2-k(t)J 


im) 


<0.7<-. 


(It is possible to bound maxima rigorously as in (|B.4|l.) Hence (|B.10|i holds. 


□ 


* As usual, the code uses interval arithmetic (f 2.6 . 
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Lemma B.2.3. Let 772 : 

7/ > 4. Then 


Proof. Clearly 


'R be as in {3.4\. Let = {\ogyt)rj 2 {t), where 

Wly)\oo < 31.521 • logy. (B.ll) 


2 1 

= r]'^{x){\ogy) + ( (logx)??^'(a:) + 


= ?72 (2;)(logy) + 4(loga;)(4(5i/4(a;) - 4<5i/2(a;) + 5i(a;)) + h{x), 


where 


{ 0 if a: < 1/4 or a: > 1, 

^(2-21og2a;) ifl/4<x<l/2, 

^(-2+ 2 log a;) ifl/2<x<l. 

(Here we are using the expression B for ?72 (2;).) Hence 

ily)ii) = (45(0 + /W)(logy) + (-16log2 • k{t) + h{t)), 


where k{t) = 2e{—tj4) — e{—tj2). Just as in the proof of Lemma B.1.2 


Again as before, this implies that ( |B.ll| l holds for 
1 


80 


160(1 + log 2) 

7r|f| 


(B.12) 


(B.13) 


\t\ > 


TT ■ 0.03892 


80+ 160(; + l°g 2))= 2252.51. 
(log 4) J 


Note also that it is enough to check ( |B.l 1[ ) for f > 0, by symmetry. Our remaining task 
is to prove (|B.l 1|) for 0 < t < 2252.21. 

Let / = [0.3, 2252.21] \ [3.25, 3.65]. For t € I, we will have 


arg 


4y(f) + f(t) 


/ TT 7r\ 

^ (“3’3) ■ 


(B.14) 


— 16 log 2 • k{t) + h{f) ^ 

(This is actually true for 0 < f < 0.3 as well, but we will use a different strategy in 
that range in order to better control error terms.) Consequently, by Lemma B.L2 and 
log y > log 4, 

< max(|4y(f) + 7(f)| • (logy), |161og2 • k{f)-h{t)\) 


< max(31.521(logy), |481og2 + 25|) = 31.521 logy. 


where we bound h{f) by (B.13 1 and by a nu merical computation of the maximum of 
|/i(f) I for 0 < f < 4 as in the proof of Lemma 


B.L2 


It remains to check (B.141. Here, as in the proof of Lemma B.2.2| the allowable 
error is relatively large (the expression on the left of (|B.14[) is actually contained in 
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(—1,1) for t G I). We decide to evaluate the argument in (B.14 1 at all t G 0.005Z n /, 
computing f{t) and h{t) by numerical integration (Simpson’s rule) with a subdivision 
of [—1/4,1] into 5000 intervals. Proceeding as in the proof of Lemma B.1.1 we see 
that the sampling induces an error of at most 


^0.005^ max((4|p"(u)| 

2 vGi 


\ify'{t)\) < < 0.00593 (B.15) 


in the evaluation of 4:g{t) + f{t), and an error of at most 

^0.005^ max((161og2 • \k"{v) \ + |(h)"(f)|) 

2 vei 

< ^ (16 log 2 • Ott^ + 247r^ • (2 — log 2 )) < 0.0121 

8 


in the evaluation of 161og2 • \k''{v)\ + |(h)"(<)|. 

Running the numerical evaluation just described for t G I, the estimates for the left 
side of ( |B.14[ ) at the sample points are at most 0.99134 in absolute value; the absolute 
values of the estimates for 4p(f) + f{t) are all at least 2.7783, and the absolute values 
of the estimates for | — 16log 2 • logfc(f) + h{t)\ are all at least 2.1166. Numerical 
integration by Simpson’s rule gives errors bounded by 0.17575 percent. Hence the 
absolute value of the left side of ( |B.14| l is at most 

0.99134 + arcsin ( + 0.0017575 ) + arcsin ( ^ + 0.0017575 ) 

Y 2.7783 J ^2.1166 J 

< 1.00271 < ^ 

o 

for t G I. 

Lastly, for t G [0, 0.3] U [3.25, 3.65], a numerical computation (samples at O.OOlZ; 
interpolation as in Lemma [B. 1.2 [ integrals computed by Simpson’s rule with a subdi¬ 
vision into 1000 intervals) gives 


max 

tG[0,0.3]U[3.25,3.65] 


l(4g(f) + /(f))l + 


— 16log2 • k{t) + h{t)\ 
log 4 


< 29.08, 


andsomaxtg[o,o.3]u[3.25,3.65] h(y)loo < 29.1 logy < 31.521 logy. □ 

An easy integral gives us that the function log -772 satisfies 


I log •772 |i = 2-log 4 


(B.17) 


The following function will appear only in a lower-order term; thus, an f 1 estimate will 
do. 


Lemma B.2.4. Let 772 : 


—y. 


. be as in {3.4 L Then 


[(log- 772 )" 1 1 = 96 log 2. 


(B.18) 
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Proof. The function log ■r]{f) is 0 for f ^ [1/4,1], is increasing and negative for t G 
(1/4,1/2) and is decreasing and positive for t G (1/2,1). Hence 

|(log-? 72 )"|oo =2 ^(log-pa)' - (log • 772 )' 

= 2(16 log 2 - (-32 log 2)) = 96 log 2. 

□ 



Appendix C 

Sums involving A and (f) 


C.l Sums over primes 


Here we treat some sums of the type where ip has compact support. 

Since the sums are over all integers (not just an arithmetic progression) and there is no 
phase e{an) involved, the treatment is relatively straightforward. 

The following is standard. 

Lemma C.1.1 (Explicit formula). Let ip : [l,oo) —tCbe continuous and piecewise 
with p" G fi; let it also be of compact support contained in [1, ckd). Then 


^A(n)(p(n) 



p{x)dx 

P 


(C.l) 


where p runs over the non-trivial zeros o/^(s). 


The non-trivial zeros of C(s) are, of course, those in the critical strip 0 < 5ft(s) < 1. 

Remark. Lemma C. 1.1 1 appears as exercise 5 in OIK041 §5.5]; the condition there 
that p be smooth can be relaxed, since already the weaker assumption that p" be in 
implies that the Mellin transform {Mp){a + it) decays quadratically on < as f —>■ oo, 
thereby guaranteeing that the sum 'Yh t){p) converges absolutely. 


Lemma C.1.2. Let x > 10. Let 772 be as in ( |77.7| ). Assume that all non-trivial zeros of 
(^(s) with |9(s)| < Tq lie on the critical line. 

Then 


n. ^ 


log^^ /9/4 6^\ 

To 2^ To J 


(C.2) 

In particular, with Tq = 3.061 • 10^*^ in the assumption, we have, for x > 2000, 


Y. = (1 + 0*{e))x + O*(0.135x’^/^), 


where e = 2.73 • 10 
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The assumption that all non-trivial zeros up to Tq = 3.061 • 10^° lie on the critical 
line was proven rigorously in flPlaall : higher values of Tq have been reached elsewhere 
( 0WedO3l . 000041 1. 


Proof. By Lemma C. 1.1 




where ‘p{u) = 772 (u/x) and p runs over all non-trivial zeros of C(s)- Since 772 is non¬ 
negative, 772 (f/a 7 )df = a;| 772 |i = x, while 



V2{t/x) 
- 1 ) 


dt = O* 


mif) 


tin — 1 / 100 ) 


dt 


= O* 


9.61114\ 

:e 2 ) ■ 


By ([ZTT), 


J2{M<p){p) ='^Mp2{p) ■ xP = p 

P P p ^ ^ ' 

= Siix)-2Siix/2) + Siix/4), 


where 

Sm{x) = ^ ^ . (C.3) 

p ^ 

Setting aside the contribution of all p with |5(p)| < Tq and all p with |3(p)| > Tq and 
5i(s) < 1 / 2 , and using the symmetry provided by the functional equation, we obtain 

|«5'm(a;)| < X ■ 3 

I 71 — \ \rn+l P ™+l 

|a(p)l>ro 

|K(p)|>l/2 

^ 1/2 ^ ^ ^ 

""" ipr +^^2 ■ ^ ipr+i' 

|a(p)l>To 


We bound the first sum by IIRos41l Lemma 17] and the second sum by IIRS03I Lemma 
2]. We obtain 


|S'm(a;)| < 


2rmTTff 




2.68 


Tr 


m+l 


1 , 

X log —-h K. 

2ii 


nl /2 


where m = 0.0463, K 2 = 0.00167 and kq = 0.0000744. 
Hence 


(C.4) 


'^{Mp){p) ■ xP 


< 



2.68 \ 9a;, cTq 

If 
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For To = 3.061 • 10^° and x > 2000, we obtain 

^ A(n)772 (-) = (1 + 0*{e))x + 0*{0.135x^^^), 

n 

where e = 2.73 • 10-i°. □ 


Corollary C.1.3. Let r ]2 be as in {11.7). Assume that all non-trivial zeros of (^{s) with 
|Sr(s)| < Tq, Tq = 3.061 • 10^*^, lie on the critical line. Then, for all x>l. 


A{n)r ]2 ^ < min ^(1 + e)x + 0.2x^^^, 1.04488x^ , (C.5) 


where e = 2.73 • 10 


Proof. Immediate from Lemma C.1.2| for x > 2000. For x < 2000, we use computa¬ 
tion as follows. Since jp^loo = 16 and X]x/ 4 <n<a; ^ > 0, computing 

J2n<x A{n)r] 2 {n/x) only for x G (1/1000)Z n [0, 2000] results in an inaccuracy of at 
most (16 • 0.0005/0.9995)a: < 0.00801a;. This resolves the matter at all points outside 
(205, 207) (for the first estimate) or outside (9.5,10.5) and (13.5,14.5) (for the second 
estimate). In those intervals, the prime powers n involved do not change (since whether 
xjA < n < X depends only on n and [a;]), and thus we can find the maximum of the 
sum in (C.51 just by taking derivatives. □ 


C.2 Sums involving 0 

We need estimates for several sums involving (j){q) in the denominator. 

The easiest are convergent sums, such as (q)/{'P{q)q)■ We can express this 

as np(l + 1 /(f(f ~ I)))- This is a convergent product, and the main task is to bound 
a tail: for r an integer, 

iognfi+ , ^ ^ t ^ it = -- (C.6) 

p(f-1)/ ^ 

A quick computatioijiJnow suffices to give 

2.591461 < y < 2.591463 (C.7) 

„ nm 


and so 


1.295730 < y 

q odd 




< 1.295732, 


(C.8) 


since the expression bounded in (C .81 is exactly half of that bounded in (C.7 1 . 


^Using D. Platt’s integer arithmetic package. 
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Again using (C. 61 , we get that 


2.826419 < 


(j){q)^ 


< 2.826421. 


(C.9) 


In what follows, we will use values for convergent sums obtained in much the same 
way - an easy tail bound followed by a computation. 

By IIRam95l Lemma 3.4], 


=^^S'^ + ^E + 0*i7.284r 

^ (l>W 


q^r 




(CIO) 


where 


q^r 
q odd 


ce 


= i+Y 


logp 

p{p - 1) 


= 1.332582275 + O*(10-73) 


by IIRS621 (2.11)]. As we already said in (fTzTSjl, this, supplemented by a computation 
for r < 4 • 10^, gives 


logr+ 1.312 < Y 


q<r 

for r > 182. In the same way, we get that 




< logr + 1.354 


^ logr + 0.83 < Y^ ^ + 0.85 


q<r 
q odd 




(C.ll) 


for r > 195. (The numerical verification here goes up to 1.38 • 10®; for r > 3.18 • 10®, 
use lCrT] ) 

Clearly 

y- iTM = y (C 12) 

hr hr 

q even q odd 

We wish to obtain bounds for the sums 

q odd q even 

where N G and r > 1. To do this, it will be helpful to express some of the 
quantities within these sums as convolutions For q square free and i > 1, 


^ ^ sr /j(^) 

(l){qy ^ a 

' ^ ab—q 


(C.13) 


^The author would like to thank O. Ramare for teaching him this technique. 
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where fj is the multiplicative function defined by 


^ {p^iyp ’ (/) = 0 for * > 2. 

We will also find the following estimate useful. 

Lemma C.2.1. Let j > 2 be an integer and A a positive real. Let m> 1 be an integer. 
Then 


E 

a>A 


T\a) ^ C(j)/C( 2 j) 






^-n 

p|m 


(C.14) 


It is useful to note that C(2)/C(4) = IS/tt^ = 1.519817... and C(3)/C(6) = 
1.181564.... 

Proof. The right side of ( C.14| i decreases as A increases, while the left side depends 
only on \Al\. Hence, it is enough to prove (C.14i when A is an integer. 

For A = 1, (|C.14|l is an equality. Let 


C = 


CO') 

C( 2 j) 


n > 


p\m 


Let A> 2. Since 


and 


aj 


a> A 


a<A 




E 

a 

= E 

a<A 




< 


E 


P^ia) ^ J_ 


a-t a-t AI 

a< A 
{a,7n) — l 


pOa) , 1 


a-t 


+ TT 


AI (j-l)A^-i’ 


we obtain 


r 1 , 

/ —dt 

I A 


E 

a> A 
(a,m) = l 






1 .c+EE^.c- V EM 

a< A 
(a,m) = l 






C - 1 

< -r—r + 


Af-i {j-l)AI 


1 


1 






C 1 

< —r + 




1 - 


1 1 


Ai-'^ J \A j-l 


AI-^ a-J 

a< A 
(a,m) = l 


- 1 
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Since (1 — \/A){\/A + 1) < 1 and 1/A + l/(j — 1) < 1 for j > 3, we obtain that 


1 - 


1 


1 1 


< 1 


Ai-^)\A^j-l 

for all integers i > 2 , and so the statement follows. □ 

We now obtain easily the estimates we want; by ( |C.13 1 and Lemma C.2.1 (with 
j = 2 and m = 1 ), 




q>r 




< 


q>r ab—q 

C(2)/C(4) 


f2ib) /r^(g) ^ f2ib) 

a q ~ ^ b 

b>l a>r/b 


6>1 


< 


6.7345 


(C.15) 


Similarly, by ( |C.13| l and Lemma [C.2.1 (with j = 2 and m = 2), 

y^ M^(g) ^ y^ f2{b) y^ P^{a) ^ C(2)/C(4) 1 


q>r 
q odd 


b ^' a 

b>l a>r/b 

b odd a odd 


12 1 


— < 
2 — 


1 + 1/22 ^ 


E/ 2 W 


b odd 


no 


p>2 


2p-l 

{P - 


< 


2.15502 


y^ M^(g) ^ y^ 
2 ^ (/(g)^ 2 ^ 


q'>r 
q even 


(?>r/2 
q odd 


y?{q) 4.31004 

(/)(g )2 - r 


(C.16) 


(C.17) 


Lastly, 

p\q)q 

(t>{q) 


E 

q<r 
q odd 


E ^ ( 7 ) E E E ^ ^ E 2 ^(d) ( 

q<r d\q ^ d<r ^ q<r d<r ^ 

d odd 


q<7 
q odd 


g<7 
d\q 
q odd 


d<r 
d odd 




i ^ E 17^ + ^ E S + ^ + 


d odd 


where we are using (C .81 and (C.ll 1 . 


(C.18) 


* * * 

Since we are on the subject of (j){q), let us also prove a simple lemma that we use 
at various points in the text to bound q/<p{q). 

Lemma C.2.2. For any q > 1 and any r > max(3, q). 
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where 


F (r) = log log r + 


2.50637 
log log r' 


(C.19) 


Proof. Since F (r) is increasing for r > 27, the statement follows immediately for 
q> 27 by ESMI Thm. 15]: 




< F{q) < Fir). 


For q < 27, it is clear that q/(j){q) < 2 • 3/(1 • 2) = 3. By the arithmetic/geometric 
mean inequality, F(t) > 2Ve'>'2.50637 > 3 for all t > e, and so the lemma holds for 
q<21. □ 
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Appendix D 

Checking small n by checking 
zeros of C,{s) 


In order to show that every odd number n < N is the sum of three primes, it is enough 
to show for some M < N that 

1. every even integer A < m < M can be written as the sum of two primes, 

2. the difference between any two consecutive primes < is at most M — A. 

(If we want to show that every odd number n < N is the sum of three odd primes, 
we just replace M — 4 by M — 6 in m .) The best known result of type Q is that 
of Oliveira e Silva, Herzog and Pardi ( llOeSHP14| , M = A ■ 10^®). As for Q, it was 
proven in IIHP13I for M = 4 • 10^® and N = 8.875694 • 10®° by a direct computation 
(valid even if we replace M — 4 by M — 6 in the statement of Q). 

Alternatively, one can establish results of type <13 by means of numerical verifica¬ 
tions of the Riemann hypothesis up to a certain height. This is a classical approach, 
followed in IIRS75I and IISch76L and later in IIRS03I : we will use the version of Q 
kindly provided by Ramare in BRamdll . We carry out this approach in full here, not 
because it is preferrable to IIHP13I - it is still based on computations, and it is slightly 
more indirect than IIHP13II - but simply to show that one can establish what we need 
by a different route. 

A numerical verification of the Riemann hypothesis up to a certain height consists 
simply in checking that all (non-trivial) zeroes z of the Riemann zeta function up to a 
height H (meaning; Sr H) lie on the critical line = 1/2. 

The height up to which the Riemann hypothesis has actually been fully verified is 
not a matter on which there is unanimity. The strongest claim in the literature is in 
IGD041 . which states that the first 10^® zeroes of the Riemann zeta function lie on the 
critical line 3 ?( 2 :) = 1/2. This corresponds to checking the Riemann hypothesis up to 
height H — 2.44599 • 10®^. It is unclear whether this computation was or could be 
easily made rigorous; as pointed out in BSD 101 p. 2398], it has not been replicated yet. 

Before BGD04I . the strongest results were those of the ZetaGrid distributed com¬ 
puting project led by S. Wedeniwski BWed03l : the method followed in it was more 


305 



























306 


APPENDIX D. CHECKING SMALE N BY CHECKING ZEROS OEC{S) 


traditional, and should allow rigorous verification involving interval arithmetic. Unfor¬ 
tunately, the results were never formally published. The statement that the ZetaGrid 
project verified the first 9 • 10^^ zeroes (corresponding to H = 2.419 • 10^^) is often 
quoted (e.g., BBomlOl p. 29]); this is the point to which the project had got by the 
time of Gourdon and Demichel’s announcement. Wedeniwski asserts in private com¬ 
munication that the project verified the first 10^^ zeroes, and that the computation was 
double-checked (by the same method). 

The strongest claim prior to ZetaGrid was that of van de Lune (H = 3.293 • 10®, 
first 10^® zeroes; unpublished). Recently, Platt IPlaal checked the first 1.1 • 10^^ ze¬ 
roes {H = 3.061 • 10^®) rigorously, following a method essentially based on that 
in HBooOhal . Note that BPlaal uses interval arithmetic, which is highly desirable for 
floating-point computations. 

Proposition D.0.3. Every odd integer 5 < n < Uq is the sum of three primes, where 

{ 5.90698 • 10®® if hGDOd'H is used (H = 2.44 • lO^®), 

6.15697 • 10^® if ZetaGrid results are used (H = 2.419 • 10^^), 
1.23163 • 1027 if ^Pl^ is used ( H = 3.061 • lO^®). 

Proof For n < 4 • 10^® -|- 3, this is immediate from IIOeSHP14l . Let 4 • 10^® -|- 3 < 
n < riQ. We need to show that there is a prime p in [n — 4 — (n — 4)/A,n — 4], 
where A is large enough for (n — 4)/A < 4 • 10^® — 4 to hold. We will then have that 
4:<n—p<4:+{n — 4)/A < 4 • 10^®. Since n — p is even, ||OeSHP14| will then 
imply that n — pis the sum of two primes p', p", and so 

n=p-\-p' +p". 

Since n — 4 > lO^^, the interval [n — 4 — (n — 4)/A, n — 4] with A = 28314000 
must contain a prime IIRS03I . This gives the solution for (n — 4) < 1.1325 • 10®®, since 
then (n - 4) < 4 • 10^® - 4. Note 1.1325 • 10®® > e®®. 

From here onwards, we use the tables in BRamdl to find acceptable values of A. 
Since n — 4 > e®®, we can choose 

{ 52211882224 if BGD04I is used (case (a)), 

13861486834 if ZetaGrid is used (case (b)), 

307779681 if BPlaal is used (case (c)). 

This gives us (n — 4)/A < 4 • 10^® — 4 for n — 4 < e’'°, where rg = 67 in case (a), 
rg = 66 in case (b) and rg = 62 in case (c). 

If n — 4 > e’'°, we can choose (again by BRamdl f 

{ 146869130682 in case (a), 

15392435100 in case (b), 

307908668 in case (c). 

This is enough for n — 4 < e®® in case (a), and without further conditions for (b) or (c). 
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Finally, if n — 4 > e®® and we are in case (a), IIRamdII assures us that the choice 

A = 147674531294 

is valid; we verify as well that (no — 4)/A < 4 • 10^® — 4. □ 

In other words, the rigorous results in IPlaal are enough to show the result for all 
odd n < 10^^. Of course, MHP13I is also more than enough, and gives stronger results 


than Prop. D.0.3 
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